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DOCKET NO. 6063.520-WO 

INSULIN AND IGF-1 RECEPTOR AGONISTS AND ANTAGONISTS 

This application is a continuation-in-part of U.S. Application Serial No. 
09/538,038 filed September 24, 2002, which is a continuation-in-part of U.S. 
5 Application Serial No. 09/538,038 filed March 29, 2000, which is a 
continuation-in-part of U.S. Application Serial No. 09/146,127, filed 
September 2, 1998, both of which are incorporated by reference in their 
entirety. 

I. FIELD OF THE INVENTION 

10 This invention relates to the field of hormone receptor activation or 

inhibition. More specifically, this invention relates to the identification of 
molecular structures, especially peptides, which are capable of acting at 
either the insulin or insulin-like growth factor receptors as agonists or 
antagonists. Also related to this invention is the field of molecular modeling 

15 whereby useful molecular models are derived from known structures. 

II. BACKGROUND OF THE INVENTION 

Insulin is a potent metabolic and growth promoting hormone that acts 
on cells to stimulate glucose, protein, and lipid metabolism, as well as RNA 
and DNA synthesis. A well-known effect of insulin is the regulation of 

20 glucose levels in the body. This effect occurs predominantly in liver, fat, and 
muscle tissue. In the liver, insulin stimulates glucose incorporation into 
glycogen and inhibits the production of glucose. In muscle and fat tissue, 
insulin stimulates glucose uptake, storage, and metabolism. Defects in 
glucose utilization are very common in the population, giving rise to 

25 diabetes. 

Insulin initiates signal transduction in target cells by binding to a 
specific cell-surface receptor, the insulin receptor (IR). The binding leads to 
conformational changes in the extracellular domain of IR, which are 
transmitted across the cell membrane and result in activation of the 
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receptor's tyrosine kinase activity. This, in turn, leads to 
autophosphorylation of tyrosine kinase of IR, and the binding of soluble 
effector molecules that contain SH2 domains such as phophoinositol-3- 
kinase, Ras GTPase-activating protein, and phospholipase Cy to IR (Lee 
5 and Pilch, 1994, Am. J. Physiol. 266:C319-C334). 

Insulin-like growth factor 1 (IGF-1) is a small, single-chain protein 
(MW = 7,500 Da) that is involved in many aspects of tissue growth and 
repair. Recently, IGF-1 has been implicated in various forms cancer 
including prostate, breast, colorectal, and ovarian cancers. It is similar in 

10 size, sequence, and structure to insulin, but has 100-1 ,000-fold lower affinity 
for IR (Mynarcik et a/., 1997, J. Biol. Chem. 272:18650-18655). 

Clinically, recombinant human IGF-1 has been investigated for the 
treatment of several diseases, including type I diabetes (Carroll et a/., 1997, 
Diabetes 46:1453-1458; Crowne et a/., 1998, Metabolism 47:31-38), 

15 amyotropic lateral sclerosis (Lai et al. t 1997, Neurology 49:1621-1630), and 
diabetic motor neuropathy (Apfel and Kessler, 1996, CIBA Found. Symp. 
196:98-108). Other potential therapeutic applications of IGF-1, such as 
osteoporosis (Canalis, 1997, Bone 21:215-216), immune modulation (Clark, 
1997, Endocr. Rev. 18:157-179) and nephrotic syndrome (Feld and 

20 Hirshberg, 1996, Pediatr. Nephrol. 10:355-358), are also under 
investigation. 

A number of studies have analyzed the role of endogenous IGF-1 in 
various disease states. Interestingly, several reports have shown that IGF-1 
promotes the growth of normal and cancerous prostate cells both in vitro 

25 and in vivo (Angelloz-Nicoud and Binoux, 1995, Endocrinol. 136:5485-5492; 
Figueroa et a/., 1995, J. Clin. Endocrinol. Metab. 80:3476-3482; Torring et 
a/., 1997, J. Urol. 158:222-227). Additionally, elevated serum IGF-1 levels 
correlate with increased risks of prostate cancer, and may be an earlier 
predictor of cancer than is prostate-specific antigen (PSA) (Chan et a/., 

30 1998, Science 279:563-566). Recent studies have indicated a connection 
between IGF-1 levels and other cancers such as breast, colorectal, and 
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ovarian. Serum IGF-1 levels are regulated by the presence of IGF binding 
proteins (IGFBP) which bind to IGF-1 and prevent its interaction with the 
IGF-1 receptor (IGF-1 R; reviewed in Conover, 1996, Endocr. J. 43S:S43- 
S48 and Rajaram etal., 1997, Endocr. Rev. 18:801-831). Interestingly, PSA 
5 has been shown to be a protease that cleaves IGFBP-3, resulting in an 
increase of free IGF-1 in serum (Cohen et a/., 1992, J. Clin. Endocrinol. 
Metab. 75:1046-1053; Cohen etal., 1994, J. Endocrinol. 142:407-415; Lilja, 
1995, J. Clin. Lab. Invest. Suppl. 220:47-56). Clearly, regulation of IGF-1 R 
activity can play an important role in several disease states, indicating that 
10 there are potential clinical applications for both IGF-1 agonists and 
antagonists. 

IGF-1 R and IR are related members of the tyrosine-kinase receptor 
superfamily of growth factor receptors. Both types of receptors are 
composed of two a and two p subunits which form a disulfide-linked 

15 heterotetramer (p-a-ct-p). These receptors have an extracellular ligand 
binding domain, a single transmembrane domain, and a cytoplasmic domain 
displaying the tyrosine kinase activity. The extracellular domain is 
composed of the entire a subunits and a portion of the N-terminus of the p 
subunits, while the intracellular portion of the p subunits contains the 

20 tyrosine kinase domain. Another family member is insulin-related receptor 
(IRR), for which no natural ligand is known. 

While similar in structure, IGF-1 R and IR serve different physiological 
functions. IR is primarily involved in metabolic functions whereas IGF-1 R 
mediates growth and differentiation. However, both insulin and IGF-1 can 

25 induce both mitogenic and metabolic effects. Whether each ligand elicits 
both activities via its own receptor, or whether insulin exerts its mitogenic 
effects through its weak affinity binding to IGF-1 R, and IGF-1 its metabolic 
effects through IR, remains controversial (De Meyts, 1994, Horm. Res. 
42:152-169). 

30 IR is a glycoprotein having molecular weight of 350-400 kDa 

(depending of the level of glycosylation). It is synthesized as a single 
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polypeptide chain and proteolytically cleaved to yield a disulfide-linked 
monomer a-p insulin receptor. Two a-p monomers are linked by disulfide 
bonds between the a-subunits to form a dimeric form of the receptor (p-a-a- 
p-type configuration). The a subunit is comprised of 723 amino acids, and it 
5 can be divided into two large homologous domains, L1 (amino acids 1-155) 
and L2 (amino acids 313-468), separated by a cysteine rich region (amino 
acids 156-312) (Ward et a/., 1995, Prot Struct Fund Genet 22:141-153). 
Many determinants of insulin binding seem to reside in the a-subunit. A 
unique feature of IR is that it is dimeric in the absence of ligand. 

1 0 The sequence of IR is highly homologous to the sequence of IGF-1 R. 

The sequence identity level varies from about 40% to 70%, depending on 
the position within the a-subunit. The three-dimensional structures of both 
receptors may therefore be similar. The crystal structure of the first three 
domains of IGF-1 R has been determined (Garrett et a/., 1998, Nature 

15 394:395-399). The L domains consist of a single-stranded right-handed p- 
helix (a helical arrangement of p-strands), while the cysteine-rich region is 
composed of eight disulfide-bonded modules. 

The p-subunit of the insulin receptor has 620 amino acid residues 
and three domains: extracellular, transmembrane, and cytosolic. The 

20 extracellular domain is linked by disulfide bridges to the a-subunit. The 
cytosolic domain includes the tyrosine kinase domain, the three-dimensional 
structure of which has been solved (Hubbard et aL, 1994, Nature 372:746- 
754). 

To aid in drug discovery efforts, a soluble form of a membrane-bound 
25 receptor was constructed by replacing the transmembrane domain and the 
intracellular domain of IR with constant domains from immunoglobulin Fc or 
k subunits (Bass et al, 1996, J. Biol. Chem. 271:19367-19375). The 
recombinant gene was expressed in human embryonic kidney 293 cells. 
The expressed protein was a fully processed heterotetramer and the ability 
30 to bind insulin was similar to that of the full-length holoreceptor. 
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IGF-1 and insulin competitively cross-react with IGF-1R and IR. (L. 
Schaffer, 1994, Eur. J. Biochem. 221:1127-1132). Despite 45% overall 
amino acid identity, insulin and IGF-1 bind only weakly to each other's 
receptor. The affinity of each peptide for the non-cognate receptor is about 
5 3 orders of magnitude lower than that for the cognate receptor (Mynarcik, et 
a/., 1997, J. Biol. Chem. 272:18650-18655). The differences in binding 
affinities may be partly explained by the differences in amino acids and 
unique domains which contribute to unique tertiary structures of ligands 
(Blakesley et a/., 1996, Cytokine Growth Factor Rev. 7(2):153-9). 

10 Both insulin and IGF-1 are expressed as precursor proteins 

comprising, among other regions, contiguous A, B, and C peptide regions, 
with the C peptide being an intervening peptide connecting the A and B 
peptides. A mature insulin molecule is composed of the A and B chains 
connected by disulfide bonds, where the connecting C peptide has been 

15 removed during post-translational processing. IGF-1 retains its smaller C- 
peptide as well as a small D extension at the C-terminal end of the A chain, 
making the mature IGF-1 slightly larger than insulin (Blakesley, 1996). The 
C region of human IGF-1 appears to be required for high affinity binding to 
IGF-1 R (Pietrzkowski et a/., 1992, Cancer Res. 52(23):6447-51 ). 

20 Specifically, tyrosine 31 located within this region appears to be essential for 
high affinity binding. Furthermore, deletion of the D domain of IGF-1 
increased the affinity of the mutant IGF-1 for binding to the IR, while 
decreasing its affinity for the IGF-1 R (Pietrzkowski et a/., 1992). A further 
distinction between the two hormones is that, unlike insulin, IGF-1 has very 

25 weak self-association and does not hexamerize (De Meyts, 1994). 

The oc-subunits, which contain the ligand binding region of IR and 
IGF-1 R, demonstrate between 47-67% overall amino acid identity. Three 
general domains have been reported for both receptors from sequence 
analysis of the a subunits, L1-Cys-rich-L2. The cysteine residues in the C- 

30 rich region are highly conserved between the two receptors; however, the 
cysteine-rich domains have only 48% overall amino acid identity. 
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Despite the similarities observed between these two receptors, the 
role of the domains in specific ligand binding are distinct. Through chimeric 
receptor studies, (domain swapping of the IR and IGF-1R a-subunits), 
researchers have reported that the sites of interaction of the ligands with 
5 their specific receptors differ (T. Kjeldsen et a/., 1991, Proc. Natl. Acad. Sci. 
USA 88:4404-4408; A.S. Andersen et al., 1992, J. Biol. Chem. 267:13681- 
13686). For example, the cysteine-rich domain of the IGF-1R was 
determined to be essential for high-affinity IGF binding, but not insulin 
binding. When amino acids 191-290 of IGF-1R region was introduced into 

10 the corresponding region of the IR (amino acids 198-300), the modified IR 
bound both IGF-1 and insulin with high affinity. Conversely, when the 
corresponding region of the IR was introduced into the IGF-1 R, the modified 
IGF-1 R bound to IR but not IGF-1 . 

A further distinction between the binding regions of the IR and IGF- 

15 1R is their differing dependence on the N-terminal and C-terminal regions. 
Both the N-terminal and C-terminal regions (located within the putative L1 
and L2 domains) of the IR are important for high-affinity insulin binding but 
appear to have little effect on IGF-1 binding for either IR or IGF-1 R. 
Replacing residues in the N-terminus of IGF-1 R (amino acids 1-62) with the 

20 corresponding residues of IR (amino acids 1-68) confers insulin-binding 
ability on IGF-1 R. Within this region, residues Phe-39, Arg-41 and Pro-42 
are reported as major contributors to the interaction with insulin (Williams et 
a/., 1995). When these residues are introduced into the equivalent site of 
IGF-1 R, the affinity for insulin is markedly increased, whereas, substitution 

25 of these residues by alanine in IR results in markedly decreased insulin 
affinity. Similarly, the region between amino acids 704-717 of the C- 
terminus of IR has been shown to play a major role in insulin specificity. 
Substitution of these residues with alanine also disrupts insulin binding 
(Mynarcik et a/., 1996, J. Biol. Chem. 271(5):2439-42; C. Kristensen et a/., 

30 1999, J. Biol. Chem. 274(52):37351 -37356). 



WO 03/070747 



PCT/US02/30312 



Further studies of alanine scanning of the receptors suggest that 
insulin and IGF-1 may use some common contacts to bind to IGF-1 R but 
that those contacts differ from those that insulin utilizes to bind to IR 
(Mynarcik et a/., 1997). Hence, the data in the literature has led one 
5 commentator to state that even though "the binding interfaces for insulin and 
IGF-1 on their respective receptors may be homologous within this interface 
the side chains which make actual contact and determine specificity may be 
quite different between the two ligand-receptor systems" (De Meyts, 1994). 
Based on data for binding of insulin and insulin analogs to various 

10 insulin receptor constructs, a binding model has been proposed. This model 
shows insulin receptor with two insulin binding sites that are positioned on 
two different surfaces of the receptor molecule, such that each alpha-subunit 
is involved in insulin binding. In this way, activation of the insulin receptor is 
believed to involve cross-connection of the alpha-subunits by insulin. A 

15 similar mechanism may operate for IGF-1 R, but one of the receptor binding 
interactions appears to be different (Schaffer, 1994, Eur. J. Biochem. 
221:1127-1132). 

The identification of molecular structures having a high degree of 
specificity for one or the other receptor is important to developing efficacious 

20 and safe therapeutics. For example, a molecule developed as an insulin 
agonist should have little or no IGF-1 activity in order to avoid the mitogenic 
activity of IGF-1 and a potential for facilitating neoplastic growth. It is 
therefore important to determine whether insulin and IGF-1 share common 
three-dimensional structures but which have sufficient differences to confer 

25 selectivity for their respective receptors. Similarly, it would be desirable to 
identify other molecular structures that mimic the active binding regions of 
insulin and/or IGF-1 and which impart selective agonist or antagonist 
activity. 

Although certain proteins are important drugs, their use as 
30 therapeutics presents several difficult problems, including the high cost of 
production and formulation, administration usually via injection and limited 
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stability in the bloodstream. Therefore, replacing proteins, including insulin 
or IGF-1, with small molecular weight drugs has received much attention. 
However, to date, none of these efforts has resulted in finding an effective 
drug replacement. 

5 Peptides mimicking functions of protein hormones have been 

previously reported. Yanofsky ef a/. (1996, Proc. Natl. Acad. Sci. USA 
93:7381-7386) reported the isolation of a monomer antagonistic to IL-1 with 
nanomolar affinity for the IL-1 receptor. This effort required construction and 
use of many phage displayed peptide libraries and sophisticated phage- 

10 panning procedures. 

Wrighton et al. (1996, Science 273:458-464) and Livnah et ai (1996, 
Science 273:464-471) reported dimer peptides that bind to the erythropoietin 
(EPO) receptor with full agonistic activity in vivo. These peptides are 
cyclical and have intra-peptide disulfide bonds; like the IL-1 receptor 

15 antagonist, they show no significant sequence identity to the natural ligand. 
Importantly, X-ray crystallography revealed that it was the spontaneous 
formation of non-covalent peptide homodimer peptides that enabled the 
dimerization two EPO receptors. 

WO 96/04557 reported the identification of peptides and antibodies 

20 that bound to active sites of biological targets, which were subsequently 
used in competition assays to identify small molecules that acted as agonist 
or antagonists at the biological targets. Renchler et al. (1994, Proc. Natl. 
Acad. Sci. USA 91:3623-3627) reported synthetic peptide ligands of the 
antigen binding receptor that induced programmed cell death in human B- 

25 cell lymphoma. 

Most recently, Cwirla ef al. (1997, Science 276:1696-1698) reported 
the identification of two families of peptides that bound to the human 
thrombopoietin (TPO) receptor and were competed by the binding of the 
natural TPO ligand. The peptide with the highest affinity, when dimerized by 

30 chemical means proved to be as potent an in vivo agonist as TPO, the 
natural ligand. 
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III. SUMMARY OF THE INVENTION 

This invention relates to the identification of amino acid sequences 
that specifically recognize sites involved in IR or IGF-1R activation. Specific 
amino acid sequences are identified and their agonist or antagonist activity 
5 at IR and/or IGF-1R has been determined. Such sequences may be 
developed as potential therapeutics or as lead compounds to develop other 
more efficacious ones. In addition, these sequences may be used in high- 
throughput screens to identify and provide information on small molecules 
that bind at these sites and mimic or antagonize the functions of insulin or 

10 IGF-1. Furthermore, the peptide sequences provided by this invention can 
be used to design secondary peptide libraries, which can be used to identify 
sequence variants that increase or modulate the binding and/or activity of 
the original peptide at IR or IGF-1 R. 

In one aspect of this invention, large numbers of peptides have been 

15 screened for their IR and IGF-1 R binding and activity characteristics. 
Analysis of their amino acid sequences has identified certain consensus 
sequences which may be used themselves or as core sequences in larger 
amino acid sequences conferring upon them agonist or antagonist activity. 
Several generic amino acid sequences are disclosed which bind IR and/or 

20 IGF-1 R with varying degrees of agonist or antagonist activity depending on 
the specific sequence of the various peptides identified within each motif 
group. Also provided are amino or carboxyl terminal extensions capable of 
modifying the affinity and/or pharmacological activity of the consensus 
sequences when part of a larger amino acid sequence. 

25 The amino acid sequences of this invention which bind IR and/or IGF- 

1R include: 

a. Xi X2 X3 X4 X 5 wherein X 1( X 2 , X4 and X 5 are aromatic amino 
acids, and X 3 is any polar amino acid (Formula 1 ; Group 1 ; A6 motif); 

b. XeX/XeXgXioXn Xi2X 13 wherein X6 and Xzare aromatic 
30 amino acids, Xe, X 9 , Xn and X 12 are any amino acid, and X i0 and X13 are 

hydrophobic amino acids (Formula 2; Group 3; B6 motif); 
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c. X14 X15 X16 X17 Xia X19 X20 X21 wherein Xi4 t and X17 are 
hydrophobic amino acids, X15, X 16 , X 18 and X19 are any amino acid, and X20 
and X21 are aromatic amino acids (Formula 3; reverse B6; revB6). 

d. X22 X23 X24 X25 X26 X27 X28 X29 X30 X31 X32 X33 X34 X35 X3e X37 X38 
5 X39X40X41 wherein X22, X25, X28, X29, X30, X33, X34, X35, X3$, X37 t X38, Xw, and 

X41 are any amino acid, X35 and X37 may be any amino acid for binding to 
IR, whereas X35 is preferably a hydrophobic amino acid and X 37 is preferably 
glycine for binding to IGF-1 R and possess agonist or antagonist activity. X23 
and X 2 6 are hydrophobic amino acids. This sequence further comprises at 
10 least two cysteine residues, preferably at X25 and X40 X 31 and X 32 are small 
amino acids (Formula 4; Group 7; E8 motif). 

e. X42 X43 X44 X45 X46 X47 X48 X49 X50 X51 X52 X53 X54 X55 X56 X57 X58 
X59 X60 X61 wherein X42, X43 t X44, X45, X53, X55, X56, Xss, X60 and X61 may be 
any amino acid, X43, X46, X49, X 50 , X54 are hydrophobic amino acids, X47 and 

15 X59 are preferably cysteines, X48 is a polar amino acid, and X51, X 52 and X 57 
are small amino acids (Formula 5; mini F8 motif). 

f. X62 X63 X64 X65 X66 X67 X68 Xe9 X70 X71 X72 X73 X74 X75 X76 X77 X78 

X79 xsoXsi wherein X62, X65, X6 8 , X69, X71, X73, X76, X77, X78, X8o» and Xei may 
be any amino acid; X63, X70, X74 are hydrophobic amino acids; X$4 is a polar 
20 amino acid, X67 and X75 are aromatic amino acids and X 72 and X 79 are 
preferably cysteines capable of forming a loop (Formula 6; Group 2; D8 
motif). 

g. H X82 Xs3 Xe4 Xss X86 Xe7 Xs8 Xe9 X90 X91 X92 wherein Xe2 is 
proline or alanine, X 8 3 is a small amino acid, Xe4 is selected from leucine, 

25 serine or threonine, Xss is a polar amino acid, Xse, Xss. Xsg and X90 are any 
amino acid, and X87, X91 and X 92 are an aliphatic amino acid (Formula 7). 

h. X104 X105 X106 X107 X108 X109 Xno X111 X112 X113 X114 wherein at 
least one of the amino acids of X106 through Xm, and preferably two, are 
tryptophan separated by three amino acids, and wherein at least one of X104, 

30 X105 and X 10 6 and at least one of X 112l Xn 3 and Xn 4 are cysteine (Formula 
8); and 
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i. an amino acid sequence comprising the sequence JBA5: 
DYKDLCQSWGVRIGWLAGLCPKK (SEQ ID NO:1541) or JBA5 minus 
FLAG® tag and terminal lysines: LCQSWGVRIGWLAGLCP (SEQ ID 
NO:1542) (Formula 9). 
5 j. W X 123 G Y Xi 24 W X 125 Xi26(SEQ ID NO: 1543) wherein X 123 is 

selected from proline, glycine, serine, arginine, alanine or leucine, but more 
preferably proline; X124 is any amino acid, but preferably a charged or 
aromatic amino acid; X125 is a hydrophobic amino acid preferably leucine or 
phenylalanine, and most preferably leucine. X 12 6 is any amino acid, but 

1 0 preferably a small amino acid (Formula 10; Group 6 motif). 

In one embodiment, peptides comprising a preferred amino acid 
sequence FYX3WF (SEQ ID NO: 1544) (Formula 1; Group 1; A6 motif) have 
been identified which competitively bind to sites on IR. Surprisingly, 
peptides comprising an amino acid sequence FYX3 WF (SEQ ID NO:1544) 

1 5 can possess agonist or antagonist activity at IR. 

This invention also identifies at least two distinct binding sites on IR 
based on the differing ability of certain of the peptides to compete with one 
another'and insulin for binding to IR. Accordingly, this invention provides 
amino acid sequences that bind specifically to one or both sites of IR. 

20 Furthermore, specific amino acid sequences are provided which have either 
agonist or antagonist characteristics based on their ability to bind to the 
specific sites of I R. 

In another embodiment of this invention, amino acid sequences which 
bind to one or more sites of IR or IGF-1R (e.g., Site 1 or Site 2) are 

25 covalently linked together to form multivalent ligands. These multivalent 
ligands are capable of forming complexes with a plurality of IR or IGF-1R. 
Either the same or different amino acid sequences are covalently bound 
together to form homo- or heterocomplexes. 

In various aspects of the invention, monomer subunits are covalently 

30 linked at their N-termini or C-termini to form N-N, C-C, N-C, or C-N linked 
dimer peptides. In one example, dimer peptides are used to form receptor 
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complexes bound through the same corresponding sites, e.g., Site 1-Site 1 
or Site 2-Site 2 dimers. Alternatively, heterodimer peptides are used to bind 
to different sites on one receptor or to cause receptor complexing through 
different sites, e.g., Site 1 -Site 2 or Site 2-Site 1 dimers. In one novel aspect 
5 of the invention, Site 2-Site 1 dimers find use as insulin agonists, while 
certain Site 1 -Site 2 dimers find use as insulin antagonists. 

In various embodiments, insulin agonists comprise Site 1-Site 1 dimer 
peptide sequences S325, S332, S333, S335, S337, S353, S374-S376, 
S378, S379, S381, S414, S415, and S418; whereas other insulin agonists 

10 comprise Site 2-Site 1 dimer peptide sequences S455, S457, S458, S467, 
S468, S471, S499, S510, S518, S519, and S520, as described herein 
below. In one preferred embodiment, an insulin agonist comprises the 
sequence of the S519 dimer peptide, which shows insulin-like activity in both 
in vitro and in vivo assays. 

1 5 The present invention also provides assays for identifying compounds 

that mimic the binding characteristics of insulin or IGF-1. Such compounds 
may act as antagonists or agonists of insulin or IGF-1 function in cell based 
assays. 

This invention further provides kits for identifying compounds that 
20 bind to IR and/or IGF-1 R. Also provided are therapeutic compounds that 
bind the insulin receptor or the IGF-1 receptor. 

Other embodiments of this invention are the nucleic acid sequences 
encoding the amino acid sequences of the invention. Also within the scope 
of this invention are vectors containing the nucleic acids and host cells 
25 which express the nucleic acids encoding the amino acid sequences which 
bind at IR and/or IGF-1 R and possess agonist or antagonist activity. 

This invention also provides amino acid sequences that bind to active 
sites of IR and/or IGF-1 R and to identify structural criteria for conferring 
agonist or antagonist activity at IR or IGF-1 R. 
30 This invention further provides specific amino acid sequences that 

possess agonist, partial agonist, or antagonist activity at either IR or IGF-1 R. 
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Such amino acid sequences are potentially useful as therapeutics 
themselves or may be used to identify other molecules, especially small 
organic molecules, which possess agonist or antagonist activity at IR or IGF- 
1R. 

5 In addition, the present invention provides structural information 

derived from the amino acid sequences of this invention, which may be used 
to construct other molecules possessing the desired activity at the relevant 
IR binding site. 

IV. BRIEF DESCRIPTION OF THE DRAWINGS 

10 Figures 1A-10; 2A-2E; 3A-3E; 4A-4I; 43A-43B, 44A-44B: Amino acid 

sequences identified by panning peptide libraries against IGF-1R and/or IR. 
The amino acids are represented by their one-letter abbreviation. The ratios 
over background are determined by dividing the signal at 405 nm (E-Tag, 
IGF-1 R, or IR) by the signal at 405 nm for non-fat milk. The IGF-1 R/IR Ratio 

15 Comparison is determined by dividing the ratio of IGF-1 R by the ratio of IR. 
The IR/IGF-1 R Ratio Comparison is determined by dividing the ratio of IR by 
the ratio of IGF-1 R. HIT indicates binder; CAND indicates binder candidate; 
LDH indicates binding to lactate dehydrogenase (negative control); Sp/lrr 
indicates the ratio of specific binding over non-specific binding. 

20 The design of each library is shown in the first line in bold. In the 

design, symbol 'X 1 indicates a random position, an underlined amino acid 
indicates a doped position at the nucleotide level, and other positions are 
held constant. Additional abbreviations in the B6H library are: 'O' indicates 
an NGY codon where Y is C or T; \J' indicates an RHR codon where R is A 

25 or G, and H is A, C, or T; and 'U' indicates an WY codon where V is A, C, or 
G, and Y is C or T. The 'h' in the 20E2 libraries indicates an NTN codon. 

Symbols in the listed sequences are: Q - TAG Stop; # -TAA Stop; * - 
TGA Stop; and ? - Unknown Amino Acid. It is believed that a W replaces 
the TGA Stop Codon when expressed. Except for the 20C and A6L 

30 libraries, all libraries are designed with the short FLAG® Epitope DYKD 
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(SEQ ID NO:1545; Hopp et a/., 1988, Bio/Technology 6:1205-1210) at the 
N-terminus of the listed sequence and AAAGAP (SEQ ID NO:1546) at the 
C-terminus. The 20C and A6L libraries have the full length FLAG® epitope 
DYKDDDDK (SEQ ID NO:1547). 
5 Figure 1A: Formula 1 motif peptide sequences obtained from a 

random 40mer library panned against IR (SEQ ID NOS:1-3). 

Figure 1B: Formula 1 motif peptide sequence obtained from a 
random 40mer library panned against IGF-1R (SEQ ID NOS:4-6). 

Figure 1C: Formula 1 motif peptide sequences obtained from a 
10 random 20mer library panned against IR (SEQ ID NOS:7-29). 

Figure 1D: Formula 1 motif peptide sequences obtained from a 
random 20mer library panned against IGF-1R (SEQ ID NOS:30-33). 

Figure 1 E: Formula 1 motif peptide sequences obtained from a 
21mer library constructed to contain Xl-joNFYDWFVXi^i (SEQ ID NO:34; 
15 also referred to as U A6S") panned against IR (SEQ ID NOS:35-98). 

Figure 1 F: Formula 1 motif peptide sequences obtained from a 
21mer library constructed to contain X1.10NFYDWFVX18-21 (SEQ ID NO:34; 
also referred to as "A6S") panned against IGF-1R (SEQ ID NOS:99-166). 

Figure 1G: Formula 1 motif peptide sequences obtained from a 
20 library constructed to contain variations outside the consensus core of the 
A6 peptide as indicated (referred to as "A6L" (SEQ ID NO: 167)) panned 
against IR (SEQ ID NOS:168-216). 

Figure 1 H: Formula 1 motif peptide sequences obtained from a 
library constructed to contain variations outside the consensus core of the 
25 A6 peptide as indicated (referred to as "A6L" (SEQ ID NO: 167)) panned 
against IGF-1R (SEQ ID NOS:21 7-244). 

Figure 11: Formula 1 motif peptide sequences obtained from a 
library constructed to contain variations in the consensus core of the E4D 
peptide (SEQ ID NO: 245) (as indicated) panned against IR (SEQ ID 
30 NOS:246-305). 
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Figure 1J: Formula 1 motif peptide sequences obtained from a 
library constructed to contain variations in the consensus core of the E4D 
peptide (SEQ ID NO: 245) (as indicated) panned against IGF-1R (SEQ ID 
NOS:306-342). 

5 Figure 1K: Formula 1 motif peptide sequences obtained from a 

library constructed using the sequence X 1 ^FHENFYDWFVRQVSX 2 i-26 
(SEQ ID NO:343; H2C-A) panned against IR (SEQ ID NOS:344-430). 

Figure 1L: Formula 1 motif peptide sequences obtained from a 
library constructed using the sequence X^FHENFYDWFVRQVSXai-ae 
10 (SEQ ID NO:343; H2C-A) panned against IGF-1R (SEQ ID NOS:431-467). 

Figure 1M: Formula 1 motif peptide sequences obtained from a 
library constructed using the sequence Xi. 6 FHXXFYXWFX 16 -2i (SEQ ID 
NO:468; H2C-B) and panned against IR (SEQ ID NOS:469-575). 

Figure 1N: Formula 1 motif peptide sequences obtained from a 
15 library constructed using the sequence XveFHXXFYXWFXi^i (SEQ ID 
NO:468; H2C-B) and panned against IGF-1R (SEQ ID NOS:576-657). 

Figure 10: Formula 1 motif peptide sequences obtained from other 
libraries panned against IR (SEQ ID NOS:658-712). 

Figure 2A: Formula 4 motif peptide sequences identified from a 
20 random 20mer library panned against IR (SEQ ID NO:713). 

Figure 2B: Formula 4 motif peptide sequences identified from a 
library constructed to contain variations in the F8 peptide (SEQ ID NO:713) 
as indicated (15% dope; referred to as "F815") panned against IR (SEQ ID 
NOS:714-796). 

25 Figure 2C: Formula 4 motif peptide sequences identified from a 

library constructed to contain variations in the F8 peptide (SEQ ID NO:713) 
as indicated (15% dope; referred to as "F815") panned against IGF-1R 
(SEQIDNOS:797-811). 

Figure 2D: Formula 4 motif peptide sequences identified from a 

30 library constructed to contain variations in the F8 peptide (SEQ ID NO: 
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713)as indicated (20% dope; referred to as "F820") panned against IR 
(SEQIDNOS:812-861). 

Figure 2E: Formula 4 motif peptide sequences identified from other 
libraries panned against IR (SEQ ID NOS:862-925). 
5 Figure 3A: Formula 6 motif peptide sequences identified from a 

random 20mer library and panned against IR (SEQ ID NOS:926-928). 

Figure 3B: Formula 6 motif peptide sequences identified from a 
library constructed to contain variations in the D8 peptide (SEQ ID NO: 929) 
as indicated (15% dope; referred to as "D815") panned against IR (SEQ ID 
10 NOS:930-967). 

Figure 3C: Formula 6 motif peptide sequences identified from a 
library constructed to contain variations in the D8 peptide (SEQ ID NO: 929) 
as indicated (20% dope; referred to as "D820") panned against IR (SEQ ID 
NOS:968-1010). 

15 Figure 3D: Formula 6 motif peptide sequences identified from a 

library constructed to contain variations in the D8 peptide (SEQ ID NO: 929) 
as indicated (20% dope; referred to as "D820") panned against IGF-1R 
(SEQIDNOS:1011-1059). 

Figure 3E: Formula 6 motif peptide sequences identified from other 
20 libraries panned against IR (SEQ ID NOS: 1060-1 061). 

Figure 4A: Formula 10 motif peptide sequences identified from 
random 20mer libraries panned against IGF-1R (SEQ ID NOS:1 062-1 077). 

Figure 4B: Formula 10 motif peptide sequences identified from 
random 20mer libraries panned against IR (SEQ ID NOS:1 078-1 082). 
25 Figure 4C: Miscellaneous peptide sequences identified from a 

random 20mer library panned against IR (SEQ ID NOS:1 083-1 086). 

Figure 4D: Miscellaneous peptide sequences identified from a 
random 40mer library panned against IR (SEQ ID NOS: 1087-1 088). 

Figure 4E: Miscellaneous peptide sequences identified from a 
30 random 20mer library panned against IGF-1R (SEQ ID NOS: 1089-1 092). 
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Figure4F: Miscellaneous peptide sequences identified from an 
C Xe-20 library and panned against IGF-1 R (SEQ ID NOS:1 093-1 113). 
Figure 4G: Miscellaneous peptide sequences identified from a 
library constructed to contain variations of the F8 peptide(SEQ ID NO: 1114) 
5 as indicated (F815) panned against IGF-1 R (SEQ ID NOS:1 115-1118). 

Figure 4H: Miscellaneous peptide sequences identified from a 
library constructed to contain variations in the F8A11 peptide(SEQ ID NO: 
1119) as indicated (referred to as "NNKH") panned against IR (SEQ ID 
NOS:1 120-1 142). 

10 Figure 41: Miscellaneous peptide sequences identified from a 

library constructed to contain variations in the F8A11 peptide(SEQ ID NO: 
1119) as indicated (referred to as "NNKH") panned against IGF-1 R (SEQ ID 
NOS:1 143-1 154). 

Figure 5A: Summary of specific representative amino acid 
15 sequences from Formulas 1, 4, 6, and 10 (SEQ ID NOS:1 155-1 180). 

Figure 5B: Summary of specific representative amino acid 
sequences from Formulas 1,4, 6, and 10 (SEQ ID NOS:1 181-1220). 

Figure 6: Illustration of 2 binding site domains on IR based on 
competition data. 

20 Figure 7: Schematic illustration of potential binding schemes to 

the multiple binding sites on IR. 

Figure 8: Biopanning results and sequence alignments of Group 

1 of IR-binding peptides (SEQ ID NOS:1221-1243). The number of 

sequences found is indicated on the right side of the figure together with 
25 data on the phage binding to either IR or IGF-1 R receptor. Absorbance 

signals are indicated by: ++++, >30X over background; +++, 15-30X; ++, 5- 

15X;+, 2-5X; and 0, <2X. 

Figures 9A-9B: Biopanning results and sequence alignments of 

Groups 2, 6, and 7 of IR-binding peptides (SEQ ID NOS: 1244-1 261). The 
30 number of sequences found is indicated on the right side of the figure 

together with data on the phage binding to either IR or IGF-1 R receptor. 
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Absorbance signals are indicated by: ++++, >30X over background; +++, 
15-30X; ++, 5-1 5X; +, 2-5X; and 0, <2X. 

Figures 10A-10C: Insulin competition data determined for various 
monomer and dimer peptides. Figure 10A shows the competition curve. 
5 Figure 10B shows the symbol key for the peptides. Figure 10C shows the 
description of the peptides. 

Figures 11 A-11D: Insulin competition data determined for various 
monomer and dimer peptides. Figure 11A shows the competition curve. 
Figure 11B shows the symbol key for the peptides. Figure 11C shows the 
10 description of the peptides. Figure 11D shows IR binding affinity for the 
peptides. 

Figures 12A-1 2D: Results of free fat cell assays for truncated 
synthetic RP9 monomer peptides, S390 and S394. Figure 12A shows the 
results for peptide S390. Figure 12B shows the results for peptide S394. 
15 Figure 12C shows the amino acid sequence of peptides S390 and S394 
(SEQ ID NOS:1794 and 1788, respectively in order of appearance). Figure 
12D shows the results for full-length RP9 peptide. 

Figures 13A-13C: Results of free fat cell assays for truncated 
synthetic RP9 dimer peptides, S415 and S417. Figure 13A shows the 
20 results for peptide S415. Figure 13B shows the results for peptide S417. 
Figure 13C shows the amino acid sequence of peptides S415 and S417 
(SEQIDNOS:1795-1796). 

Figures 14A-14C: Results of free fat cell assays for RP9 
homodimer peptides, 521 and 535. Figure 14A shows the results for 
25 peptide 521. Figure 14B shows the results for peptide 535. Figure 14C 
shows the amino acid sequence of peptides 521 and 535. 

Figures 15A-15C: Results of free fat cell assays for RP9-D8 
heterodimer peptides, 537 and 538. Figure 15A shows the results for 
peptide 537. Figure 15B shows the results for peptide 538. Figure 15C 
30 shows the amino acid sequence of peptides 537 and 538. 
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Figures 16A-16C: Results of free fat cell assays for RP9-D8 
heterodimer peptides 537 and 538. Figure 16A shows the results for 
peptide 537. Figure 16B shows the results for peptide 538. Figure 16C 
shows the amino acid sequence of peptides 537 and 538. 
5 Figures 17A-17B: Results of free fat cell assays for D8-RP9 

heterodimer peptide, 539. Figure 1 7A shows the results for peptide 539. 
Figure 17B shows the amino acid sequence of peptide 539. 

Figures 18A-18D: Results of free fat cell assays for Site 1/Site 2 
dimer peptides with constituent monomer peptides with Site 1-Site 2 C-N 
10 (Figure 18A), Site 1-Site 2, N-N (Figure 18B), Site 1-Site 2, C-C (Figure 
18C), and Site 2-Site 1, C-N (Figure 18D) orientations and linkages, 
respectively. 

Figures 19A-19B: Results of human insulin receptor kinase assays 
for various monomer and dimer peptides. Figure 19A shows the substrate 
15 phosphorylation curve. Figure 19B shows the EC 50 values. 

Figures 20A-20B: Results of human insulin receptor kinase assays 
for Site 1-Site 2 and Site 2-Site 1 dimer peptides. Figure 20A shows the 
substrate phosphorylation curve. Figure 20B shows the EC 50 values. 

Figures 21A-21B: Results of human insulin receptor kinase assays 
20 for Site 1-Site 2 and Site 2-Site 1 peptides. Figure 21 A shows the substrate 
phosphorylation curve. Figure 21 B shows the EC 50 values. 

Figures 22A-22B: Results of time-resolved fluorescence resonance 
transfer assays for assessing the ability of various monomer and dimer 
peptides to compete with biotinylated RP9 monomer peptide for binding to 
25 soluble human insulin receptor-immunoglobulin heavy chain chimera. 
Figure 22A shows the binding curve. Figure 22B shows the symbol key and 
description of the peptide sequences (SEQ ID NOS:2117, 1916-1917, 1558, 
1994, 1960-1961, 2008, 1794, 2015-2016, 1560, and 2001-2002, 
respectively in order of appearance). 
30 Figures 23A-23C: Results of time-resolved fluorescence resonance 

transfer assays indicating the ability of various monomer and dimer peptide 
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to compete with biotinylated S175 monomer peptide or biotinylated RP9 
monomer peptide for binding to soluble human insulin receptor- 
immunoglobulin heavy chain chimera. Figures 23A-23B show the binding 
curves. Figure 23C shows the symbol key and description of the peptide 
5 sequences (SEQ ID NOS:2117, 1916-1917, 1558, 1994, 1960-1961, 2008, 
1794, 2015-2016, 1560, and 2001-2002, respectively in order of 
appearance). 

Figures 24 A-24B: Results of fluorescence polarization assays 
indicating the ability of various monomer and dimer peptide to compete with 

10 fluorescein labeled RP9 monomer peptide for binding to soluble human 
insulin receptor ectodomain. Figure 24A shows the binding curve. Figure 
24B shows the symbol key and description of the peptide sequences (SEQ 
ID NOS:2117, 1916-1917, 1558, 1994, 1960-1961, 2008, 1794, 2015-2016, 
1 560 and 2001-2002, respectively in order of appearance). 

15 Figures 25A-25B: Results of fluorescence polarization assays 

indicating the ability of various monomer and dimer peptides to compete 
with fluorescein labeled RP9 monomer peptide for binding to soluble human 
insulin mini-receptor. Figure 25A shows the binding curve. Figure 25B 
shows the symbol key and description of the peptide sequences (SEQ ID 

20 NOS:2117, 1916-1917, 1558, 1994, 1960-1961, 2008, 1794, 2015-2016, 
1560, and 2001-2002, respectively in order of appearance). 

Figures 26A-26B: Results of fluorescence polarization assays 
indicating the ability of various monomer and dimer peptides to compete 
with fluorescein labeled insulin for binding to soluble human insulin receptor 

25 ectodomain. Figure 26A shows the binding curve. Figure 26B shows the 
symbol key and description of the peptide sequences (SEQ ID NOS:2117, 
1916-1917, 1558, 1994, 1960-1961, 2008, 1794, 2015-2016, 1560, and 
2001-2002, respectively in order of appearance). 

Figures 27A-27B: Results of fluorescence polarization assays 

30 indicating the ability of various monomer and dimer peptides to compete 
with fluorescein labeled insulin for binding to soluble human insulin mini- 
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receptor. Figure 27A shows the binding curve. Figure 27B shows the 
symbol key and description of the peptide sequences (SEQ ID NOS:2117, 
1916-1917, 1558, 1994, 1960-1961, 2008, 1794, 2015-2016, 1560, and 
2001-2002, respectively in order of appearance). 
5 Figure 28: A schematic drawing for the construction of protein 

fusions of the maltose binding protein. 

Figure 29: BIAcore analysis of competition binding between IR and 
maltose binding protein fusion peptides H2C-9aa-H2C, H2C, and H2C-3aa- 
H2C. 

10 Figure 30: Stimulation of IR autophosphorylation in vivo by maltose 

binding protein fusion peptides. 

Figures 31 A-31C: Results of free fat cell assays for insulin and Site 

2-Site 1 peptides, S519 and S520. Figure 31 A shows the results for S519. 

Figure 31 B shows the results for S520. Figure 31 C shows the EC50 values. 
15 Figures 32A-32B: Results of human insulin receptor kinase assays 

for insulin and Site 2-Site 1 peptides S519 and S520. Figure 32A shows the 

substrate phosphorylation curve. Figure 32B shows the calculated Bestfit 

values. 

Figure 33: Results of in vivo experiments showing the effect of 
20 intravenous administration of Site 2-Site 1 peptide S519 in Wistar rats: 

Figures 34A-34E: Results of phage competition studies with IGF-1 
surrogates RP9 (Site 1) and D815 (Site 2) peptides. Phage: RP9 (A6-like); 
RP6 (B6-like); D8B12 (Site 2); and D815 (Site 2); Peptides: RP9 and D815. 
Figures 34A-34B show the competition curves. Figures 34C-34E show the 
25 symbol keys and peptide groups. 

Figure 35A-35E: Phage competition studies with Site 2-Site 1 
dimer peptides containing 6- or 12-amino acid linkers. Phage: RP9, RP6, 
D8B12, and D815; Peptides: D815-6L-RP9 and D815-12L-RP9. Figures 
35A-35B show the competition curves. Figures 35C-35E show the symbol 
30 keys and peptide groups. 
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Figure 36: Results of IGF-1 agonist assay using FDCP-2 cells. 
Site 1 peptides RP6, RP9, G33, and Site 2 peptide D815 were tested in the 
agonist assay. 

Figure 37: Results of IGF-1 antagonist assay using FDCP-2 cells. 
5 Site 1 peptides RP6, RP9, G33, and Site 2 peptide D815 were tested in the 
antagonist assay. 

Figure 38: Results of IGF-1 agonist assay using FDCP-2 cells. 
Site 1 peptides 20E2, S175, and RP9 were tested in the agonist assay. 

Figures 39: Results of agonist and antagonist studies with surrogate 
10 monomers and dimers. Monomers: D815 and RP9; Dimers: D815-6aa- 
RP9 and D815-12aa-RP9. 

Figure 40: Results of agonist and antagonist studies with surrogate 
monomers and dimers. Monomers: G33 and D815; Dimer: D815-6aa-G33. 
Figure 41 : Results of agonist and antagonist studies with surrogate 
15 peptides and dimers. Monomers: G33, D815 and RP9; Dimers: D815-6aa- 
RP9 and D815-12aa-RP9. 

Figure 42: IGF-1 standard curve using FDCP-2 cells. 
Figures 43A-43B: Peptide monomers identified from G33 and RP6 
secondary libraries panned against IGF-1 R (SEQ ID NOS: 1262-1 432). 
20 Figure 43A shows peptides from G33 secondary library; Figure 43B shows 
peptides from RP6 secondary library. 

Figures 44A-44B: Peptide dimers identified from libraries panned 
against IR or IGF-1R (SEQ ID NOS:1433-1540). Figure 44A shows dimer 
peptides panned against IR; Figure 44B shows dimer peptides panned 
25 against IGF-1 R. 

Figure 45: Results of heterogeneous time-resolved fluorometric 
assays showing the effect of recombinant peptide surrogate G33 (rG33) on 
the binding of biotinylated-recombinant human IGF-1 (b-rhlGF-1) to 
recombinant human IGF-1 R (rhlGF-1R). 
30 Figure 46: Results of heterogeneous time-resolved fluorometric 

assays showing the effect of recombinant peptide surrogate D815 (rD815) 
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on the binding of biotinylated-recombinant human IGF-1 (b-rhlGF-1) to 
recombinant human IGF-1 R (rhlGF-1R). 

Figure 47: Results of heterogeneous time-resolved fluorometric 
assays showing the effect of recombinant peptide surrogate RP9 on the 
5 binding of biotinylated-recombinant human IGF-1 (b-rhlGF-1 ) to recombinant 
human IGF-1 R (rhlGF-1R). 

Figure 48: Results of heterogeneous time-resolved fluorometric 
assay showing the effect of recombinant peptide surrogate D815-6-G33 on 
the binding of biotinylated-recombinant human IGF-1 (b-rhlGF-1) to 
1 0 recombinant human IGF-1 R (rhlGF-1 R). 

Figure 49: Results of heterogeneous time-resolved fluorometric 
assays showing the effect of recombinant peptide surrogate D815-6-RP9 on 
the binding of biotinylated-recombinant human IGF-1 (b-rhlGF-1) to 
recombinant human IGF-1 R (rhlGF-1R). 
15 Figure 50: Results of heterogeneous time-resolved fluorometric 

assays showing the effect of recombinant peptide surrogate D815-12-RP9 
on the binding of biotinylated-recombinant human IGF-1 (b-rhlGF-1) to 
recombinant human IGF-1 R (rhlGF-1R). 

Figure 51: Results of heterogeneous time-resolved fluorometric 
20 assays showing the effect of IGF-1 on the binding of biotinylated- 
recombinant human IGF-1 (b-rhlGF-1) to recombinant human IGF-1 R 
(rhlGF-1R). 

Figure 52: Results of time-resolved fluorescence resonance energy 
transfer assays showing the effect of Site 1 peptide surrogates, Site 2 
25 peptide surrogates, and rhlGF-1 on the dissociation of biotinylated-20E2 (b- 
20E2, Site 1 ) from recombinant human IGF-1 R. 

Figure 53: Results of time-resolved fluorescence resonance energy 
transfer assays showing the effect of various peptide monomers and dimers 
on the dissociation of biotinylated-20E2 (b-20E2, Site 1) from recombinant 
30 human IGF-1 R. 
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Figure 54: Results of glucose uptake assays in SGBS cells showing 
the potency of peptide S597 relative to human insulin. 

Figure 55: Results of glucose-lowering assays in rats showing the 
potency of peptide S557 and S597 relative to human insulin. 
5 Figure 56: Results of glucose-lowering assays in fasted Goettingen 

minipigs showing the potency of peptide S597 relative to human insulin. 

Figure 57: Results of studies of disappearance of l 125 -labelled 
peptides from site of injection. 

V. DETAILED DESCRIPTION OF THE INVENTION 

10 This invention relates to amino acid sequences comprising motifs that 

bind to the insulin receptor (IR) and/or insulin-like growth factor receptor 
(IGF-1R). In addition to binding to IR and/or IGF-1R, the amino acid 
sequences also possess either agonist, partial agonist or antagonist activity 
at IR or IGF-1R. In addition, the amino acid sequences bind to separate 

1 5 binding sites (Sites 1 or 2) on IR or IGF-1 R. 

Although capable of binding to IR or IGF-1 R at sites which participate 
in conferring agonist or antagonist activity, the amino acid sequences are 
not based on the native insulin or IGF-1 sequences, nor do they reflect an 
obvious homology to any such sequences. 

20 The amino acid sequences of the invention may be peptides, 

polypeptides, or proteins. These terms as used herein should not be 
considered limiting with respect to the size of the various amino acid 
sequences referred to herein and which are encompassed within this 
invention. Thus, any amino acid sequence comprising at least one of the IR 

25 or IGF-1 R binding motifs disclosed herein, and which binds to IR or IGF-1 R 
is within the scope of this invention. In preferred embodiments, the amino 
acid sequences confer insulin or IGF-1 agonist or antagonist activity. The 
amino acid sequences of the invention are typically artificial, i.e., non- 
naturally occurring, peptides, or polypeptides. Amino acid sequences useful 

30 in the invention may be obtained through various means such as chemical 
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synthesis, phage display, cleavage of proteins or polypeptides into 
fragments, or by any means which amino acid sequences of sufficient length 
to possess binding ability may be made or obtained. 

The amino acid sequences provided by this invention should have an 
5 affinity for IR sufficient to provide adequate binding for the intended purpose. 
Thus, for use as a therapeutic, the peptide, polypeptide, or protein provided 
by this invention should have an affinity (K<j) of between about 10" 7 to about 
10* 15 M. More preferably the affinity is 10" 6 to about 10" 12 M. Most 
preferably, the affinity is 1CT 10 to about 10" 12 M. For use as a reagent in a 
10 competitive binding assay to identify other ligands, the amino acid sequence 
preferably has affinity for the receptor of between about 10" 5 to about 10~ 12 
M. 

The present invention describes several different binding motifs, 
which bind to active sites on IR or IGF-1R. The binding motifs are defined 

15 based on the analysis of several different amino acid sequences and 
analyzing the frequency that particular amino acids or types of amino acids 
occur at a particular position of the amino acid sequence as described in the 
related applications of Beasley et a/. International Application 
PCT/US00/08528, filed March 29, 2000, and Beasley et a/., U.S. Application 

20 Serial No. 09/538,038, filed March 29, 2000. 

Also included within the scope of this invention are amino acid 
sequences containing substitutions, additions, or deletions based on the 
teachings disclosed herein and which bind to IR or IGF-1R with the same or 
altered affinity. For example, sequence tags (e.g., FLAG® tags) or amino 

25 acids, such as one or more lysines, can be added to the peptide sequences 
of the invention (e.g., at the N-terminal or C-terminal ends) as described in 
detail herein. Sequence tags can be used for peptide purification or 
localization. Lysines can be used to increase peptide solubility or to allow 
for biotinylation. Alternatively, amino acid residues located at the carboxy 

30 and amino terminal regions of the consensus motifs described below, which 
comprise sequence tags (e.g., FLAG® tags), or which contain amino acid 
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residues that are not associated with a strong preference for a particular 
amino acid, may optionally be deleted providing for truncated sequences. 
Certain amino acids (e.g., C-terminal or N-terminal residues) such as lysine 
which promote the stability or biotinylation of the amino acids sequences 
5 may be deleted depending on the use of the sequence, as for example, 
expression of the sequence as part of a larger sequence which is soluble, or 
linked to a solid support. 

Peptides that bind to IR or IGF-1R, and methods and kits for 
identifying such peptides, have been disclosed by Beasley et a/., 
10 International Application PCT/US00/08528 filed March 29, 2000 and 
Beasley et a/., U.S. Application Serial No. 09/538,038 filed March 29, 2000, 
which are incorporated by reference in their entirety. 

A. Consensus Motifs 

The following motifs have been identified as conferring binding 

15 activity to IR and/or IGF-1R: 

1 . X1X2X3X4X5 (Formula 1 ; Group 1 ; the A6 motif) wherein X^ , X 2 , 
X4 and X 5 are aromatic amino acids, preferably, phenylalanine or tyrosine. 
Most preferably, X1 and X 5 are phenylalanine and X 2 is tyrosine. X 3 may be 
any small polar amino acid, but is preferably selected from aspartic acid, 

20 glutamic acid, glycine, or serine, and is most preferably aspartic acid or 
glutamic acid. X4 is most preferably tryptophan, tyrosine, or phenylalanine 
and most preferably tryptophan. Particularly preferred embodiments of the 
A6 motif are FYDWF (SEQ ID NO:1554) and FYEWF (SEQ ID NO:1555). 
The A6 motif possesses agonist activity at IGF-1R, but agonist or antagonist 

25 activity at IR depending on the identity of amino acids flanking A6. See 
Figure 5A. 

Amino acid sequences that comprise the A6 motif and possess 
agonist activity at IR, include but are not limited to, D117/H2C: 
FHENFYDWFVRQVSKK (SEQ ID NO:1556); D117/H2 minus terminal 
30 lysines: FHENFYDWFVRQVS (SEQ ID NO:1557); RP9: 
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GSLDESFYDWFERQLGKK (SEQ ID NO:1558); RP9 minus terminal 
lysines: GSLDESFYDWFERQLG (SEQ ID NO:1559); and S175: 
GRVDWLQRIMANFYDWFVAELG (SEQ ID NO:1560). Preferred RP9 
sequences include GLADEDFYEWFERQLR (SEQ ID NO:1561), 
5 GLADELFYEWFDRQLS (SEQ ID NO: 1562), GQLDEDFYEWFDRQLS 
(SEQ ID NO:1563), GQLDEDFYAWFDRQLS (SEQ ID NO: 1564), 
GFMDESFYEWFERQLR (SEQ ID NO: 1565), GFWDESFYAWFERQLR 
(SEQ ID NO:1566), GFMDESFYAWFERQLR (SEQ ID NO:1567), and 
GFWDESFYEWFERQLR (SEQ ID NO:1568). Nonlimiting examples of 
10 Group 1 (Formula 1; A6) amino acid sequences are shown in Figures 1A- 
10. 

2. XeXyXsXgXioXnX^Xia (Formula 2, Group 3; the B6 motif) 
wherein X 6 and X 7 are aromatic amino acids, preferably, phenylalanine or 
tyrosine. Most preferably, Xg is phenylalanine and X7 is tyrosine. X 8 , X 9 , Xn, 

1 5 and X12 may be any amino acid. X10 and X13 are hydrophobic amino acids, 
preferably leucine, isoleucine, phenylalanine, tryptophan or methionine, but 
more preferably leucine or isoleucine. X10 is most preferably isoleucine for 
binding to IR and leucine for binding to IGF-1R. X13 is most preferably 
leucine. Amino acid sequences of Formula 2 may function as an antagonist 

20 at the IGF-1R, or as an agonist at the I R. Preferred consensus sequences 
of the Formula 2 motif are FYXa X 9 L Xn X 12 L (SEQ ID NO: 1569), F Y X 8 X 9 
I Xn X12 L (SEQ ID NO:1570), F YX3A I Xn X12L (SEQ ID NO:1571), and F 
YXsY F Xn X12L (SEQ ID NO: 1572). 

Another Formula 2 motif for use with this invention comprises F Y Xe 

25 Y F Xn X 12 L (SEQ ID NO:1573) and is shown as Formula 2A ("NNRP") 
below: X 115 Xn 6 X m Xn 8 F Y Xe Y F Xn X 12 L X 119 X i20 X 121 Xi 22 , (SEQ ID 
NO:1574) wherein Xns-Xm and Xns-X^ may be any amino acid which 
allows for binding to IR or IGF-1 R. Xns is preferably selected from the group 
consisting of tryptophan, glycine, aspartic acid, glutamic acid, and arginine. 

30 Aspartic acid, glutamic acid, glycine, and arginine are more preferred. 
Tryptophan is most preferred. The preference for tryptophan is based on its 
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presence in clones at a frequency three to five fold higher than that 
expected over chance for a random substitution, whereas aspartic acid, 
glutamic acid and arginine are present about two fold over the frequency 
expected for random substitution. 
5 Xii 6 preferably is an amino acid selected from the group consisting of 

aspartic acid, histidine, glycine, and asparagine. X 117 and Xub are 
preferably glycine, aspartic acid, glutamic acid, asparagine, or alanine. 
More preferably Xn 7 is glycine, aspartic acid, glutamic acid and asparagine 
whereas Xn 8 is more preferably glycine, aspartic acid, glutamic acid or 

10 alanine. Xa when present in the Formula 2A motif is preferably arginine, 
glycine, glutamic acid, or serine. Xn when present in the Formula 2A motif 
is preferably glutamic acid, asparagine, glutamine, or tryptophan, but most 
preferably glutamic acid, Xi 2 when present in the Formula 2A motif is 
preferably aspartic acid, glutamic acid, glycine, lysine or glutamine, but most 

15 preferably aspartic acid. X119 is preferably glutamic acid, glycine, glutamine, 
aspartic acid or alanine, but most preferably glutamic acid. X120 is preferably 
glutamic acid, aspartic acid, glycine or glutamine, but most preferably 
glutamic acid. X121 is preferably tryptophan, tyrosine, glutamic acid, 
phenylalanine, histidine, or aspartic acid, but most preferably tryptophan or 

20 tyrosine. X122 is preferably glutamic acid, aspartic acid or glycine; but most 
preferably glutamic acid. Preferred amino acid residue are identified based 
on their frequency in clones over two fold over that expected for a random 
event, whereas the more preferred sequences occur about 3-5 times as 
frequently as expected. 

25 3. Xi 4 Xi5X 1 6Xi7Xi 8 Xi9X2oX2i (Formula 3, reverse B6, revB6), 

wherein X i4 and X 17 are hydrophobic amino acids; X 14 , X 17 are preferably 
leucine, isoleucine, and valine, but most preferably leucine; X15, Xi 6 , Xie and 
X19 may be any amino acid; X20 is an aromatic amino acid, preferably 
tyrosine or histidine, but most preferably tyrosine; and X21 is an aromatic 

30 amino acid, but preferably phenylalanine or tyrosine, and most preferably 
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phenylalanine. For use as an IGF-1R binding ligand, an aromatic amino 
acid is strongly preferred at Xi 8 . 

4. X22X23X24X25X2eX27X28X29X3oX3iX32X33X34X35X36X37X3eX39X40 

X41 (Formula 4; Group 7; the F8 motif) wherein X22, X 25 , X 2 e, X 2 8, X29, X30, 
5 X33, X34, X35, X 36 , X37, X38, X40, and X4! are any amino acid. X 35 and X37 may 
be any amino acid when the F8 motif is used as an IR binding ligand or as a 
component of an IR binding ligand, however for use as an IGF-1R binding 
ligand, glycine is strongly preferred at X37 and a hydrophobic amino acid, 
particularly, leucine, is preferred at X35. X23 is a hydrophobic amino acid. 

10 Methionine, valine, leucine or isoleucine are preferred amino acids for X23, 
however, leucine which is most preferred for preparation of an IGF-1R 
binding ligand is especially preferred for preparation of an IR binding ligand. 
At least one cysteine is located at X24 through X27, and one at X39 or X40. 
Together the cysteines are capable of forming a cysteine cross-link to create 

15 a looped amino acid sequence. In addition, although a spacing of 14 amino 
acids in between the two cysteine residues is preferred, other spacings may 
also be used provided binding to IGF-1R or IR is maintained. Accordingly, 
other amino acids may be substituted for the cysteines at positions X 2 4 and 
X39 if the cysteines occupy other positions. 

20 In one embodiment, for example, the cysteine at position X24 may 

occur at position X27 which will produce a smaller loop provided that the 
cysteine is maintained at position X39. These smaller looped peptides are 
described herein as Formula 5, infra. X27 is any polar amino acid, but is 
preferably selected from glutamic acid, glutamine, aspartic acid, asparagine, 

25 or as discussed above cysteine. The presence of glutamic acid at position 
X 2 7 decreases binding to IR but has less of an effect on binding to IGF-1R. 
X31 is any aromatic amino acid and X32 is any small amino acid. For binding 
to IGF-1R, glycine or serine is preferred at position X31, however, tryptophan 
is highly preferred for binding to IR. At position X 32 , glycine is preferred for 

30 both IGF-1R and IR binding. X 36 is an aromatic amino acid. A preferred 
consensus sequence for F8 is X22 LC X25 X 26 E X 2 s X29 X 3 o WG X 33 X 34 X35 
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X36 X37 X38 C X40 X41 (SEQ ID NO:1575) whereas the amino acids are 
defined above. A more preferred F8 sequence is 

HLCVLEELFWGASLFGYCSG ("F8"; SEQ ID NO:1576). Amino acid 
sequences comprising the F8 sequence motif preferably bind to IR over 
5 IGF-1 R. Figures 2A-2E list nonlimiting examples of Formula 4 amino acid 
sequences. 

5. X42 X43 X44 X45 X46 X47 X48 X49 X50 X51 X52 X53 X54 X55 X56 X 57 
X58X59X60 X 6 i (Formula 5; mini F8 motif) wherein X42, X43.X44, X45, X 53t X55, 
X56, X58, X60 and X61 are any amino acid. X43, X46, X49, X50 and X54 are 

10 hydrophobic amino acids, however, X43 and X46 are preferably leucine, 
whereas X50 is preferably phenylalanine or tyrosine but most preferably 
phenylalanine. X47 and X59 are cysteines. X48 is preferably a polar amino 
acid, i.e., aspartic acid or glutamic acid, but most preferably glutamic acid. 
Use of the small amino acid at position 54 may confer IGF-1 R specificity. 

15 X 5 i, X52, and X 57 are small amino acids, preferably glycine. A preferred 
consensus sequence for mini F8 is X42 X43 X44 X45 L C E X49 F G G X 53 X54 
X55 X56 G X58 C X 60 Xei (SEQ ID NO:1577). Amino acid sequences 
comprising the sequence of Formula 5 preferably bind to IGF-1 R or IR. 

6. X62 X63 X64 X65 X66 X67 X68 X$9 X70 X71 X72 X73 X74 X75 X76 X77 

20 X 7 8 X79 X 80 X 8 i (Formula 6; Group 2; the D8 motif) wherein X 62 , Xes, X 68 , Xeg, 
X 7 i, X73, X 7 6, X77, X 7 8, X 8 o and X 8 i may be any amino acid. X 6 6 may also be 
any amino acid, however, there is a strong preference for glutamic acid. 
Substitution of Xe6 with glutamine or valine may result in attenuation of 
binding. X63, X 7 o, and X74 are hydrophobic amino acids. X63 is preferably 

25 leucine, isoleucine, methionine, or valine, but most preferably leucine. X 7 o 
and X74 are preferably valine, isoleucine, leucine, or methionine. X 74 is most 
preferably valine. X64 is a polar amino acid, more preferably aspartic acid or 
glutamic acid, and most preferably glutamic acid. X 6 7 and X75 are aromatic 
amino acids. Whereas tryptophan is highly preferred at Xe7, X75 is preferably 

30 tyrosine or tryptophan but most preferably tyrosine. X72 and X79 are 
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cysteines that again are believed to form a loop which position amino acid 
may be altered by shifting the cysteines in the amino acid sequence. 

D8 is most useful as an amino acid sequence having a preference for 
binding to IR as only a few D8 sequences capable of binding to IGF-1 R over 
5 background have been detected. A preferred sequence for binding to I R is 

X62 L X64 X65 X66 W X68 X69 X70 X71 C X73 X74 X75 X76 X77 X78 C Xqq Xq-\ (SEQ 

ID NO:1578). Examples of specific peptide sequences comprising this motif 
include D8: KWLDQEWAWVQCEVYGRGCPSKK (SEQ ID NO:1579); and 
D8 minus terminal lysines: KWLDQEWAWVQCEVYGRGCPS (SEQ ID 

10 NO: 1580). Preferred D8 monomer sequences include 

SLEEEWAQIQCEIYGRGCRY (SEQ ID NO:1581) and 
SLEEEWAQIQCEIWGRGCRY (SEQ ID NO:1582). Preferred D8 dimer 
sequences include SLEEEWAQIECEVYGRGCPS (SEQ ID NO:1583), and 
SLEEEWAQI ECEVWGRGCPS (SEQ ID NO:1584). Nonlimiting examples 

15 of Group 2 (Formula 6; D8) amino acid sequences are shown in Figures 3A- 
3E. 

7. H X82» X83» X&4 X85 Xs6 X 8 7 Xs 8 X89 X90 X91 X92 (Formula 7) 
wherein Xs 2 is proline or alanine but most preferably proline; Xe 3 is a small 
amino acid more preferably proline, serine or threonine and most preferably 

20 proline; Xs4 is selected from leucine, serine or threonine but most preferably 
leucine; Xs 5 is a polar amino acid preferably glutamic acid, serine, lysine or 
asparagine but more preferably serine; X 86 may be any amino acid but is 
preferably a polar amino acid such as histidine, glutamic acid, aspartic acid, 
or glutamine; X 8 7 is an aliphatic amino acid preferably leucine, methionine or 

25 isoleucine and most preferably leucine; amino acid X 8 b, Xsg and X 90 may be 
any amino acids; X 91 is an aliphatic amino acid with a strong preference for 
leucine as is X92. Phenylalanine may also be used at position 92. A 
preferred consensus sequence of Formula 7 is H P P L S Xe 6 L Xss X 89 X90 L 
L (SEQ ID NO:1585). The Formula 7 motif binds to IR with little or no 

30 binding to IGF-1 R. 
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8. Another sequence is Xkm X105 Xioe X107 Xioe X 10 9 Xno Xm 
X112 X113 X114 (Formula 8) which comprises eleven amino acids wherein at 
least one, and preferably two of the amino acids of Xi 06 through Xm are 
tryptophan. In addition, it is also preferred that when two tryptophan amino 
5 acids are present in the sequence they are separated by three amino acids, 
which are preferably, in sequential order proline, threonine and tyrosine with 
proline being adjacent to the tryptophan at the amino terminal end. 
Accordingly, the most preferred sequence for X107 X 10 s X109 Xn 0 Xm is 
WPTYW (SEQ ID NO:1586). At least one of the three amino acids on the 

10 amino terminal (X104, X105 Xi 06 ) and at least one of the amino acids carboxy 
terminal (Xn 2 X 1i3 X m ) ends immediately flanking X107-X111 are preferably a 
cysteine residue, most preferably at X105 and Xn 3 respectively. Without 
being bound by theory, the cysteines are preferably spaced so as to allow 
for the formation of a loop structure. X 10 4 and Xn 4 are both small amino 

15 acids such as, for example, alanine and glycine. Most preferably, X104 is 
alanine and Xn 4 is glycine. X i0 5 may be any amino acid but is preferably 
valine. X i12 is preferably asparagine. Thus, the most preferred sequence is 
ACVWPTYWNCG (SEQ ID NO:1587). 

9. An amino acid sequence comprising JBA5: 
20 DYKDLCQSWGVRIGWLAGLCPKK (SEQ ID NO: 1541); or JBA5 without 

terminal lysines: LCQSWGVRIGWLAGLCP (SEQ ID NO:1542) (Formula 9). 
The Formula 9 motif is another motif believed to form a cysteine loop that 
possesses agonist activity at both IR and IGF-1R. Although IR binding is 
not detectable by ELISA, binding of Formula 9 to I R is competed by insulin 
25 and is agonistic. 

10. W X 123 G Y X 124 W X 125 X 126 (SEQ ID NO:1543) (Formula 10; 
Group 6) wherein X i23 is selected from proline, glycine, serine, arginine, 
alanine or leucine, but more preferably proline; X i24 is any amino acid, but 
preferably a charged or aromatic amino acid; X125 is a hydrophobic amino 

30 acid preferably leucine or phenylalanine, and most preferably leucine. Xi 26 
is any amino acid, but preferably a small amino acid. In one embodiment of 
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the present invention, the Formula 10, Group 6 motif is WPGY (SEQ ID 
NO: 1588). Examples of specific peptide sequences comprising this motif 
include E8: KVRGFQGGTVWPGYEWLRNAAKK (SEQ ID NO: 1589); and 
E8 minus terminal lysines: KVRGFQGGTVWPGYEWLRNAA (SEQ ID 
5 NO:1590). Preferred Group 6 sequences include WAGYEWF (SEQ ID 
NO:1591), WEGYEWL (SEQ ID NO:1592), WAGYEWL (SEQ ID NO:1593), 
WEGYEWF (SEQ ID NO: 1594), and DSDWAGYEWFEEQLD (SEQ ID 
NO: 1595). Nonlimiting examples of Group 6 amino acid sequences are 
shown in Figures 4A^B. 

10 The IR and IGF-1R binding activities of representative Group 1 

(Formula 1; A6); Group 2 (Formula 6; D8); and Group 6 (Formula 10); and 
Group 7 (Formula 4; F8) amino acid sequences are summarized in Figures 
8 and 9A-9B. Group 1 (Formula 1 ; A6) amino acid sequences contain the 
consensus sequence FyxWF (SEQ ID NO:1596), which is typically agonistic 

15 in cell-based assays. Group 2 (Formula 6; D8) amino acid sequences are 
composed of two internal sequences having a consensus sequence VYGR 
(SEQ ID NO:1597) and two cysteine residues each. Thus, Group 2 peptides 
are capable of forming a cyclic peptide bridged with a disulfide bond. 
Neither of these consensus sequences have any significant linear sequence 

20 similarities greater than 2 or 3 amino acids with mature insulin. Group 7 
(Formula 4; F8) amino acid sequences are composed of two internal 
exemplary sequences which do not have any significant sequence 
homology, but have two cysteine residues 13-14 residues apart, thus being 
capable of forming a cyclic peptide with a long loop anchored by a disulfide 

25 bridge. 

B. Amino And Carboxyl Terminal Extensions Modulate 
Activity of Motifs 

In addition to the motifs stated above, the invention also provides 
preferred sequences at the amino terminal or carboxyl terminal ends which 
30 are capable of enhancing binding of the motifs to either IR, IGF-1R, or both. 
In addition, the use of the extensions described below does not preclude the 
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possible use of the motifs with other substitutions, additions or deletions 
that allow for binding to IR f IGF-1R, or both. 

1. Formula 1 

Any amino acid sequence may be used for extensions of the amino 
5 terminal end of A6, although certain amino acids in amino terminal 
extensions may be identified which modulate activity. Preferred carboxy 
terminal extensions for A6 are A6 X93 X94 X 95 X96 X 97 wherein X93 may be 
any amino acid, but is preferably selected from the group consisting of 
alanine, valine, aspartic acid, glutamic acid, and arginine, and X94 and X 97 

10 are any amino acid; X 95 is preferably glutamine, glutamic acid, alanine or 
lysine but most preferably glutamine. The presence of glutamic acid at X^ 
however may confer some IR selectivity. Further, the failure to obtain 
sequences having an asparagine or aspartic acid at position X 95 may 
indicate that these amino acids should be avoided to maintain or enhance 

15 sufficient binding to IR and IGF-1 R. X 96 is preferably a hydrophobic or 
aliphatic amino acid, more preferably leucine, isoleucine, valine, or 
tryptophan but most preferably leucine. Hydrophobic residues, especially 
tryptophan at Xge may be used to enhance IR selectivity. 

2. Formula 2 

20 B6 with amino terminal and carboxy terminal extensions may be 

represented as X 98 X 99 B6 X100. X 98 is optionally aspartic acid and X 99 is 
independently an amino acid selected from the group consisting of glycine, 
glutamine, and proline. The presence of an aspartic acid at X 98 and a 
proline at X 99 is associated with an enhancement of binding for both IR and 

25 IGF-1 R. A hydrophobic amino acid is preferred for the amino acid at X100, an 
aliphatic amino acid is more preferred. Most preferably leucine, for IR and 
valine for IGF-1 R. Negatively charged amino acids are preferred at both the 
amino and carboxy terminals of Formula 2A. 
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3. Formula 3 

An amino terminal extension of Formula 3 defined as X101 X 10 2 X103 
revB6 wherein X 10 3 is a hydrophobic amino acid, preferably leucine, 
isoleucine or valine, and X102 and X101 are preferably polar amino acids, 
5 more preferably aspartic acid or glutamic acid may be useful for enhancing 
binding to IR and IGF-1R. No preference is apparent for the amino acids at 
the carboxy terminal end of Formula 3. 

4. Formula 10 

In one preferred embodiment, Formula 10 sequences W X123 G Y 
10 X124 W X125 X126 (SEQ ID NO:1543) can include an amino terminal extension 
comprising the sequence DSD and/or a carboxy terminal extension 
comprising the sequence EQLD (SEQ ID NO:1598). 

C. IR Binding Preferences 

As indicated above, the amino acid sequences containing the motifs 
15 of this invention may be constructed to have enhanced selectivity for either 
IR or IGF-1R by choosing appropriate amino acids at specific positions of 
the motifs or the regions flanking them. By providing amino acid 
preferences for IR or IGF-1 R, this invention provides the means for 
constructing amino acid sequences with minimized activity at the non- 
20 cognate receptor. For example, the amino acid sequences disclosed herein 
with high affinity and activity for IR and low affinity and activity for IGF-1 R 
are desirable as IR agonist as their propensity to promote undesirable cell 
proliferation, an activity of IGF-1 agonists, is reduced. Ratios of IR binding 
affinity to IGF-1 R binding affinity for specific sequences are provided in 
25 Figures 1A-10; 2A-2E; 3A-3E; 4A-4I; 44A-44B. As an insulin therapeutic, 
the IR/IGF-1R binding affinity ratio is preferably greater than 100. 
Conversely, for use as an IGF-1 R therapeutic, the IR/IGF-1R ratio should be 
less than 0.01. Examples of peptides that selectively bind to IGF-1 R are 
shown below. 
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Besides relative binding at IR or IGF-1 R, relative efficacy at the 
cognate receptor is another important consideration for choosing a potential 
therapeutic. Thus, a sequence that is efficacious at IR but has little or no 
significant activity at IGF-1R may also be considered as an important IR 
5 therapeutic, irrespective of the relative binding affinities at IR and IGF-1R. 
For example, A6 selectivity for IR may be enhanced by including glutamic 
acid in a carboxyl terminal extension at position X95. IR selectivity of the B6 
motif may be enhanced by having a tryptophan or phenylalanine at Xn. 
Tryptophan at X13 also favors selectivity of IR. A tryptophan amino acid at 

10 X13 rather than leucine at that position also may be used to enhance 
selectivity for IR. In the reverse B6 motif, a large amino acid at X15 favors IR 
selectivity. Conversely, small amino acids may confer specificity for IGF-1 R. 
In the F8 motif, an L in position X23 is essentially required for IR binding. In 
addition, tryptophan at X31 is also highly preferred. At X 32 , glycine is 

1 5 preferred for I R selectivity. 

D. Multiple Binding Sites On IR And IGF-1 R 

The competition data disclosed herein reveals that at least two 
separate binding sites are present on IR and IGF-1 R which recognize the 
different sequence motifs provided by this invention. 

20 As shown in Figure 6, competition data indicate that peptides 

comprising the A6 motifs compete for binding to the same site on IR (Site 1) 
whereas the D8 motifs compete for a second site (Site 2). The identification 
of peptides that bind to separate binding sites on IR and IGF-1 R provides for 
various schemes of binding to IR or IGF-1 R to increase or decrease its 

25 activity. Examples of such schemes for IR are illustrated in Figure 7. 

The table below shows sequences based on their groups, which bind 
to Site 1 or Site 2. 
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TABLE 2 

REPRESENTATIVE SITE 1 PEPTIDES 
A6-llke (FYxWF) (SEQ ID NO: 1596): 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Clone 


Sequence 


SEO ID NO 


G3 


KRGGGTFYEWFESALRKHGAGKK 


1718 


H2 


VTFTSAVFHENFYDWFVRQVSKK 




H2C 


FHENFYDWFVRQVSKK 


1556 


A6S-IR3-E12 


GRVDWLQRNANFYDWFVAELG 




A6S-IR4-G1 


NGVERAGTGDNFYDWFVAQLH 


1720 


H2CB-R3-B12 


QSDSGTVHDRFYGWFRDTWAS 


1721 


20E2A-R3-B11 


GRFYGWFQDAIDQLMPWGFDP 


1722 


rB6-F6 


RYGRWGLAQQ FYDWFDR 


1723 


E4Da-l-B8-IR~ 


GFREGQRW YW FVAQVT 


1724 


H2CA-4-F11-IR 


TYKARFLHENFYDWFNRQVSQYFGRV 


1725 


H2CB-R3-D2 


WTDVDGFHSGFYRWFQNQWER 


1726 


H2CB-R3-D12 


VASGHVLHGQ F YRWFVDQFAL 


1727 


H2CB-R4-H5 


QARVGNVHQQ FYEWFREVMQG 


1728 


H2C-B-E8* 


TGHRLGLDEQFYWWFRDALSG 


1729 


H2CB-3-B6-IR- 


VGDFCVSHDCFYGWFLRESMQ 


1730 


A6S-IR2-C1 


RMYFSTGAPQNFYDWFVQEWD 


1731 


B6-like (FYxxLxxL) (SEQ ID NO: 1732): 




Clone 


Sequence 




20C11 


KDRAFYNGLRDLVGAVYGAWDKK 


1733 


20E2 


DYKDFYDAIDQLVRGSARAGGTRDKK 


1734 


B62-R3-C7 


EHWNTVDPFYFTLFBWLRESG 


1735 


B62-R3-C10 


EHWNTVDPFYQYFSELLRESG 


1736 


20E2B-3-B3-IR 


AGVNAGFYRYFSTLLDWWDQG 


1737 


20E2-B-E3* 


IQGWEPFYGWFDDWAQMFEE 


1738 


20E2A-R4-F9 


PPWGARFYDAI EQLVFDNLCC 


1739 


RPNN-4-G6-HOLO* 


RWPNFYGYFESLLTHFS 


1740 


RPNN-4-F3-HOLO* 


HYNAFYEYFQVLLAETW 


1741 


20E2A-R4-E2 


IGRVRSFYDAI DKLFQSDWER 


1742 


RPNN-2-C1-IR* 


EGWDFYSYFSGLLASVT 


1743 


20E2B-4-F12-IR 


SVKEVQFYRYFYDLLQSEESG 


1744 


20E2-B-E12 


GNSGGS F YR Y FQLLLDSDGMS 


1 7/1 R 


20E2A-R3-B6 


RDAGS SF YDAI DQLVCLTYFC 


1 74fi 

A / *i O 


Reverse B6-like (LxxLxxYF) (SEQ ID NO: 1747): 




Clone 


Sequence 




rB6-A12 


LDALDRLMRYFEERPSL 


1748 


rB6-F9 


PLAELWAYFEHSEQGRSSAH 


1749 


rB6-4-E7-IR 


LDPLDALLQYFWSVPGH 


1750 


rB6-4-F9-IR 


RGRLGSLSTQFYNWFAE 


1751 


rB6-E6 


ADELE WLLD Y FMHQPRP 


1752 


rB6-4-Fl2-IR 


DGVLEELFS YFSATVGP 


1753 


Group 6 (WPxYxWL) (SEQ ID NO: 1754): 




Clone 


Sequence 




R2O0-4-A4-IR 


WPGYLFFEEALQDWRGSTED 


1755 


Peptides by design**: 






Clone 


Sequence 




H2C-PD1-IR- 


AAVHEQFYDWFADQYKK 


1756 


A6S-PD1-IR- 


QAPSNFYDWFVREWDKK 


1757 


20E2-PD1-IR- 


QSFYDYI EELLGGEWKK 


1758 
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B6C-PD1-IR- 


DPFYQGLWEWLRESGKK 


1759 


5 


REPRESENTATIVE SITE 2 PEPTIDES (C-C LOOPS) 






F8-derived (Long C-C loop): 






Clone 


Sequence SEQ ID 




F8 


HLCVLEELFWGASLFGYCSG 


1760 




P8-C12 


FQSLLEELVWGAPLFRYGTG 


1761 


10 


F8-Dea2 


PLCVLEELFWGASLFGYCSG 


1762 




F8-F12 


PLCVLEELFWGASLFGQCSG 


1763 




F8-B9 


HLCVLEELFWGASLFGQCSG 


1764 


15 


F8-B12 


DLRVLCELFGGAYVIrGYCSE 


1765 


NNKH-2B3 


HRSVLKQLSWGASLFGQWAG 


1766 




NNKH-2F9- 


HLSVGEELSWWVALLGQWAR 


1767 




NNKH-4H4- 


APVSTEELRWGALLFGQWAG 


1768 


20 


D8-derived (Small C-C loop): 






Clone 


Sequence 


SEQ I 




D8 


KWLDQEWAWVQCEVYGRGCPSKK 


1769 




D8-G1 


QLEEEWAGVQCEVYGRECPS 


1770 


25 


D8-B5- 


ALEEEWAWVQVRS I RSGLPL 


1771 


D8-A7 


SLDQEWAWVQCEVYGRGCLS 


1772 




D8-F1- 


WLEHEWAQIQCELYGRGCTY 


1773 




Midi C-C loop: 






30 


Clone 


Sequence 




D8-F10 


GLEQGCPWVGLEVQCRGCPS 


1774 




F8-B12- 


DLRVLCELFGGAYVLGYCSE 


1775 




F8-A9 


PLWGLCELFGGASLFGYCSS 


1776 



35 "Based on analysis of entire panning data, amino acid preferences at each position were calculated 
to define these "idealized" peptides; * Peptides synthesized and currently being purified; - Peptides 
planned. 

In various aspects of the present invention, amino acid sequences 
comprising Site 1 motifs may bind to Site 1 of IR or Site 1 of IGF-1R. 

40 Similarly, amino acids sequences comprising Site 2 motifs may bind to Site 
2 of IR or Site 2 of IGF-1 R. However, specific peptides may show higher 
binding affinity for IR than for IGF-1 R, while other peptides may show higher 
binding affinity for IGF-1 R than for IR. In addition, Site 1 and Site 2 on IR do 
not "crosstalk", i.e., Site 1 -binding sequences do not compete with Site 2- 

45 binding sequences at IR. In contrast, Site 1 and Site 2 on IGF-1 R do show 
some crosstalk, suggesting an allosteric effect. These aspects are 
illustrated in the Examples described hereinbelow. 
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E. Multivalent Ligands 

This invention provides ligands that preferentially bind different sites 
on IR and IGF-1R. The A6 amino acid sequence motif confers binding to IR 
at Site 1 (Figure 6). The D8 amino acid sequence motif confers binding to 
5 IR at Site 2 (Figure 6). Accordingly, multimeric ligands may be prepared 
according to the invention by covalently linking amino acid sequences. 
Depending on the purpose intended for the multivalent ligand, amino acid 
sequences that bind the same or different sites may be combined to form a 
single molecule. Where the multivalent ligand is constructed to bind to the 

10 same corresponding site on different receptors, or different subunits of a 
receptor, the amino acid sequences of the ligand for binding to the receptors 
may be the same or different, provided that if different amino acid 
sequences are used, they both bind to the same site. 

Multivalent ligands may be prepared by either expressing amino acid 

15 sequences which bind to the individual sites separately and then covalently 
linking them together, or by expressing the multivalent ligand as a single 
amino acid sequence which comprises within it the combination of specific 
amino acid sequences for binding. 

Various combinations of amino acid sequences may be combined to 

20 produce multivalent ligands having specific desirable properties. Thus, 
agonists may be combined with agonists, antagonists combined with 
antagonists, and agonists combined with antagonists. Combining amino 
acid sequences that bind to the same site to form a multivalent ligand may 
be useful to produce molecules that are capable of cross-linking together 

25 multiple receptor units. Multivalent ligands may also be constructed to 
combine amino acid sequences which bind to different sites (Figure 7). 

In view of the discovery disclosed herein of monomers having agonist 
properties at IR or IGF-1R, preparation of multivalent ligands may be useful 
to prepare ligands having more desirable pharmacokinetic properties due to 

30 the presence of multiple bind sites on a single molecule. In addition, 
combining amino acid sequences that bind to different sites with different 
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affinities provides a means for modulating the overall potency and affinity of 
the ligandfor IRor IGF-1R. 

1 . Construction of Hybrids 

In one embodiment, hybrids of at least two peptides (e.g., dimer 
5 peptides) may be produced as recombinant fusion polypeptides, which are 
expressed in any suitable expression system. The polypeptides may bind 
the receptor as either fusion constructs containing amino acid sequences 
besides the ligand binding sequences or as cleaved proteins from which 
signal sequences or other sequences unrelated to ligand binding are 

10 removed. Sequences for facilitating purification of the fusion protein may 
also be expressed as part of the construct. Such sequences optionally may 
be subsequently removed to produce the mature binding ligand. 
Recombinant expression also provides means for producing large quantities 
of ligand. In addition, recombinant expression may be used to express 

15 different combinations of amino acid sequences and to vary the orientation 
of their combination, i.e., amino to carboxyl terminal orientation. 

In one embodiment shown below (Figure 28), MBP-FLAG®- 
PEPTIDE-(GGS) n (SEQ ID NO: 1777)-PEPTIDE-E-TAG, a fusion construct 
producing a peptide dimer comprises a maltose binding protein amino acid 

20 sequence (MBP) or similar sequence useful for enabling the affinity 
chromatography purification of the expressed peptide sequences. This 
purification facilitating sequence may then be attached to a FLAG® 
sequence to provide a cleavage site to remove the initial sequence. The 
dimer then follows which includes the intervening linker and a tag sequence 

25 may be included at the carboxyl terminal portion to facilitate 
identification/purification of the expression of peptide. In the representative 
construct illustrated above, G and S are glycine and serine residues, which 
make up the linker sequence. As non-limiting examples, n can be 1 , 2, 3, or 
4 to yield a linker sequence of 3, 6, 9, and 12 amino acids, respectively. 
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In addition to producing the dimer peptides by recombinant protein 
expression, dimer peptides may also be produced by peptide synthesis 
whereby a synthetic technique such as Merrifield synthesis (Merrifield, 
1997), may be used to construct the entire peptide. 
5 Other methods of constructing dimer peptides include introducing a 

linker molecule that activates the terminal end of a peptide so that it can 
covalently bind to a second peptide. Examples of such linkers include, but 
are not limited to, diaminoproprionic acid activated with an oxyamino 
function. A preferred linker is a dialdehyde having the formula 0=CH- 

10 (CH 2 ) n -CH=O f wherein n is at least 2 to 6, but is preferably 6 to produce a 
linker of about 25 to 30 angstroms in length. Other preferred linkers are 
shown in Table 3. Linkers may be used, for example, to couple monomers 
at either the carboxyl terminal or the amino terminal ends to form dimer 
peptides. Also, the chemistry can be inverted, i.e., the peptides to be 

15 coupled can be equipped with aldehyde functions, either by oxidation with 
sodium periodate of an N-terminal serine, or by oxidation of any other vicinal 
hydroxy- or amino-groups, and the linker can comprise two oxyamino 
functions (e.g., at end of a polyethylene glycol linker) or amino groups which 
are coupled by reductive amination. 

20 In specific embodiments, Site 1-Site 2 and Site 2-Site 1 orientations 

are possible. In addition, N-terminal to N-terminal (N-N); C-terminal to C- 
terminal (C-C); N-terminal to C-terminal (N-C); and C-terminal to N-terminal 
(C-N) linkages are possible. Accordingly, peptides may be oriented Site 1 to 
Site 2, or Site 2 to Site 1, and may be linked N-terminus to N-terminus, C- 

25 terminus to C-terminus, N-terminus to C-terminus, or C-terminus to N- 
terminus. In certain cases, a specific orientation may be preferable to 
others, for example, for maximal agonist or antagonist activity. 

In an unexpected and surprising result, the orientation and linkage of 
the monomer subunits has been found to dramatically alter dimer activity 

30 (see Examples, below). In particular, certain Site 1/Site 2 heterodimer 
sequences show agonist or antagonist activity at IR, depending on 
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orientation and linkage of the constituent monomer subunits. For example, 
a Site 1-Site 2 orientation (C-N linkage), e.g., the S453 heterodimer, shows 
antagonist activity at IR (Figure 18A; Table 7). In contrast, a Site 2-Site 1 
orientation (C-N linkage), e.g., the S455 heterodimer, shows potent agonist 
5 activity at IR (Figure 18D; Table 7). Similarly, Site 1-Site 2 (C-N linkage) 
heterodimers, e.g., S425 and S459, show antagonist activity at IR (Table 7), 
while Site 1-Site 2 (C-C or N-N linkage) heterodimers, e.g., S432-S438, 
S454, and S456, show agonist activity (Table 7). 

Whether produced by recombinant gene expression or by 

10 conventional chemical linkage technology, the various amino acid 
sequences may be coupled through linkers of various lengths. Where linked 
sequences are expressed recombinantly, and based on an average amino 
acid length of about 4 angstroms, the linkers for connecting the two amino 
acid sequences would typically range from about 3 to about 12 amino acids 

15 corresponding to from about 12 to about 48 A. Accordingly, the preferred 
distance between binding sequences is from about 2 to about 50 A. More 
preferred is 4 to about 40. The degree of flexibility of the linker between the 
amino acid sequences may be modulated by the choice of amino acids used 
to construct the linker. The combination of glycine and serine is useful for 

20 producing a flexible, relatively unrestrictive linker. A more rigid linker may 
be constructed by using amino acids with more complex side chains within 
the linkage sequence. 

2. Characterization Of Specific Dimers 

Specific dimers which are comprised of monomer subunits that both 
25 bind with high affinity to the same site on IR (i.e., homodimers), or monomer 
subunits that bind to different sites on IR (i.e., heterodimers) are disclosed 
herein. 

Other combinations of peptides are within the scope of this invention 
and may be determined as demonstrated in the examples described herein. 



WO 03/070747 



PCTYUS02/30312 



-47- 

F. Peptide Synthesis 

Many conventional techniques in molecular biology, protein 
biochemistry, and immunology may be used to produce the amino acid 
sequences for use with this invention. The present invention encompasses 
5 the specific amino acid sequences shown in Figures 1-4, 8, and 9 and Table 
7, inter alia, without additions (e.g., linker or spacer sequences) deletions, 
alterations, or modification. The present invention further encompasses 
variants that include additional sequences, altered sequences, and 
functional fragments thereof. In a preferred embodiment, the amino acid 

10 sequence variant or fragment shares at least one function characteristic 
(e.g., binding, agonist, or antagonist activity) of the reference sequence. 
Variant peptides include, for example, genetically engineered mutants, and 
may differ from the amino acid sequences shown in the figures and tables of 
the application by the addition, deletion, or substitution of one or more 

15 amino acid residues. Alterations may occur at the amino- or carboxy- 
terminal positions of the reference amino acid sequence or anywhere 
between those terminal positions, interspersed either individually among the 
amino acids in the reference sequence or in one or more contiguous groups 
within the reference sequence. In addition, variants may comprise synthetic 

20 or non-naturally occurring amino acids in accordance with this invention. 

Variant amino acid sequences can have conservative changes, 
wherein a substituted amino acid has similar structural or chemical 
properties, e.g., replacement of leucine with isoleucine. More infrequently, a 
variant peptide can have non-conservative changes, e.g., substitution of a 

25 glycine with a tryptophan. Guidance in determining which amino acid 
residues can be substituted, inserted, or deleted without abolishing binding 
or biological activity can be found using computer programs well known in 
the art, for example, DNASTAR software (DNASTAR, Inc., Madison, Wl). 
Guidance is also provided by the data disclosed herein. In particular, 

30 Figures 1-4, 8, 9, 43, 44, and Table 7, inter alia, teach which amino acid 
residues can be deleted, added, substituted, or modified, while maintaining 
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the IR- or IGF-1R-related function(s) (e.g., binding, agonist, or antagonist 
activity) of the amino acid sequences. 

For the purposes of this invention, the amino acids are grouped as 
follows: amino acids possessing alcohol groups are serine (S) and 
5 threonine (T). Aliphatic amino acids are isoleucine (I), leucine (L), valine 
(V), and methionine (M). Aromatic amino acids are phenylalanine (F), 
histidine (H), tryptophan (W), and tyrosine (Y). Hydrophobic amino acids 
are alanine (A), cysteine (C), phenylalanine (F) f glycine (G), histidine (H), 
isoleucine (I), leucine (L), methionine (M), arginine (R), threonine (T), valine 

10 (V), tryptophan (W), and tyrosine (Y). Negative amino acids are aspartic 
acid (D) and glutamic acid (E). The following amino acids are polar amino 
acids: cysteine (C), aspartic acid (D), glutamic acid (E), histidine (H), lysine 
(K), asparagine (N), glutamine (Q), arginine (R), serine (S), and threonine 
(T). Positive amino acids are histidine (H) f lysine (K), and arginine (R). 

15 Small amino acids are alanine (A), cysteine (C), aspartic acid (D), glycine 
(G), asparagine (N), proline (P), serine (S), threonine (T), and valine (V). 
Very small amino acids are alanine (A), glycine (G) and serine (S). Amino 
acids likely to be involved in a turn formation are alanine (A), cysteine (C), 
aspartic acid (D), glutamic acid (E), glycine (G) ( histidine (H), lysine (K), 

20 asparagine (N), glutamine (Q), arginine (R), serine (S), proline (P), and 
threonine (T). As non-limiting examples, the amino acids within each of 
these defined groups may be substituted for each other in the formulas 
described above, as conservative substitutions, subject to the specific 
preferences stated herein. 

25 Substantial changes in function can be made by selecting 

substitutions that are less conservative than those shown in the defined 
groups, above. For example, non-conservative substitutions can be made 
which more significantly affect the structure of the peptide in the area of the 
alteration, for example, the alpha-helical, or beta-sheet structure; the charge 

30 or hydrophobicity of the molecule at the target site; or the bulk of the side 
chain. The substitutions which generally are expected to produce the 
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greatest changes in the peptide's properties are those where 1) a 
hydrophilic residue, e.g., seryl or threonyl, is substituted for (or by) a 
hydrophobic residue, e.g., leucyl, isoleucyl, phenylalanyl, valyl, or alanyl; 2) 
a cysteine or proline is substituted for (or by) any other residue; 3) a residue 
5 having an electropositive side chain, e.g., lysyl, arginyl, or histidyl, is 
substituted for (or by) an electronegative residue, e.g., glutamyl or aspartyl; 
or 4) a residue having a bulky side chain, e.g., phenylalanine, is substituted 
for (or by) a residue that does not have a side chain, e.g., glycine. 

Amino acid preferences have been identified for certain peptides and 

10 peptide groups of the present invention. For example, amino acid 
preferences for the RP9, D8, and Group 6 (Formula 10) peptides are shown 
in Tables 17-19, below. 

Variants also include amino acid sequences in which one or more 
residues are modified (i.e., by phosphorylation, sulfation, acylation, 

15 PEGylation, etc.), and mutants comprising one or more modified residues. 
Amino acid sequences may also be modified with a label capable of 
providing a detectable signal, either directly or indirectly, including, but not 
limited to, radioisotope, fluorescent, and enzyme labels. Fluorescent labels 
include, for example, Cy3, Cy5, Alexa, BODIPY, fluorescein (e.g., FluorX, 

20 DTAF, and FITC), rhodamine (e.g., TRITC), auramine, Texas Red, AMCA 
blue, and Lucifer Yellow. Preferred isotope labels include 3 H, 14 C, 32 P, 35 
S, 36 CI, 51 Cr, 57 Co, 58 Co, 59 Fe, 90 Y, 125 I, 131 I, and 186 Re. Preferred 
enzyme labels include peroxidase, p-glucuronidase, p-D-glucosidase, p-D- 
galactosidase, urease, glucose oxidase plus peroxidase, and alkaline 

25 phosphatase (see, e.g., U.S. Pat. Nos. 3,654,090; 3,850,752 and 
4,016,043). Enzymes can be conjugated by reaction with bridging 
molecules such as carbodiimides, diisocyanates, glutaraldehyde, and the 
like. Enzyme labels can be detected visually, or measured by calorimetric, 
spectrophotometric, fluorospectrophotometric, amperometric, or gasometric 

30 techniques. Other labeling systems, such as avidin/biotin, Tyramide Signal 
Amplification (TSA™), are known in the art, and are commercially available 



WO 03/070747 



PCT/US02/30312 



; -50- 

(see, e.g., ABC kit, Vector Laboratories, Inc., Burlingame, CA; NEN® Life 
Science Products, Inc., Boston, MA). 

1 . Recombinant Synthesis of Peptides 

To obtain recombinant peptides, DNA sequences encoding these 
5 peptides may be cloned into any suitable vectors for expression in intact 
host cells or in cell-free translation systems by methods well known in the art 
(see Sambrook et a/., 1989). The particular choice of the vector, host, or 
translation system is not critical to the practice of the invention. 

A large number of vectors, including bacterial, yeast, and mammalian 

10 vectors, have been described for replication and/or expression in various 
host cells or cell-free systems, and may be used for gene therapy as well as 
for simple cloning or protein expression. In one aspect of the present 
invention, an expression vector comprises a nucleic acid encoding a IR or 
IGF-1R agonist or antagonist peptide, as described herein, operably linked 

15 to at least one regulatory sequence. Regulatory sequences are known in 
the art and are selected to direct expression of the desired protein in an 
appropriate host cell. Accordingly, the term regulatory sequence includes 
promoters, enhancers and other expression control elements (see D.V. 
Goeddel (1990) Methods Enzymol. 185:3-7). Enhancer and other 

20 expression control sequences are described in Enhancers and Eukaryotic 
Gene Expression, Cold Spring Harbor Press, Cold Spring Harbor, NY 
(1983). It should be understood that the design of the expression vector 
may depend on such factors as the choice of the host cell to be transfected 
and/or the type of peptide desired to be expressed. 

25 Several regulatory elements (e.g., promoters) have been isolated and 

shown to be effective in the transcription and translation of heterologous 
proteins in the various hosts. Such regulatory regions, methods of isolation, 
manner of manipulation, etc. are known in the art. Non-limiting examples of 
bacterial promoters include the (3-lactamase (penicillinase) promoter; lactose 

30 promoter; tryptophan (trp) promoter; araBAD (arabinose) operon promoter; 
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lambda-derived Pi promoter and N gene ribosome binding site; and the 
hybrid tac promoter derived from sequences of the tip and lac UV5 
promoters. Non-limiting examples of yeast promoters include the 3- 
phosphoglycerate kinase promoter, glyceraldehyde-3-phosphate 
5 dehydrogenase (GAPDH) promoter, galactokinase (GAL1) promoter, 
galactoepimerase promoter, and alcohol dehydrogenase (ADH1) promoter. 
Suitable promoters for mammalian cells include, without limitation, viral 
promoters, such as those from Simian Virus 40 (SV40), Rous sarcoma virus 
(RSV), adenovirus (ADV), and bovine papilloma virus (BPV). Preferred 

10 replication and inheritance systems include M13, ColE1, SV40, baculovirus, 
lambda, adenovirus, CEN ARS, 2\xm ARS and the like. While expression 
vectors may replicate autonomously, they may also replicate by being 
inserted into the genome of the host cell, by methods well known in the art. 
To obtain expression in eukaryotic cells, terminator sequences, 

15 polyadenylation sequences, and enhancer sequences that modulate gene 
expression may be required. Sequences that cause amplification of the 
gene may also be desirable. Furthermore, sequences that facilitate 
secretion of the recombinant product from cells, including, but not limited to, 
bacteria, yeast, and animal cells, such as secretory signal sequences and/or 

20 preprotein or proprotein sequences, may also be included. These 
sequences are well described in the art. DNA sequences can be optimized, 
if desired, for more efficient expression in a given host organism or 
expression system. For example, codons can be altered to conform to the 
preferred codon usage in a given host cell or cell-free translation system 

25 using well-established techniques. 

Codon usage data can be obtained from publicly-available sources, 
for example, the Codon Usage Database at http://www.kazusa.or.jp/codon/. 
In addition, computer programs that translate amino acid sequence 
information into nucleotide sequence information in accordance with codon 

30 preferences (i.e., backtranslation programs) are widely available. See, for 
example, Backtranslate program from Genetics Computer Group (GCG), 
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Accelrys, Inc., Madison, Wl; and Backtranslation Applet from Entelechon 
GmbH, Regensburg, Germany. Thus, using the peptide sequences 
disclosed herein, one of ordinary skill in the art can design nucleic acids to 
yield optimal expression levels in the translation system or host cell of 
5 choice. 

Expression and cloning vectors will likely contain a selectable marker, 
a gene encoding a protein necessary for survival or growth of a host cell 
transformed with the vector. The presence of this gene ensures growth of 
only those host cells that express the inserts. Typical selection genes 

10 encode proteins that 1) confer resistance to antibiotics or other toxic 
substances, e.g., ampicillin, neomycin, methotrexate, etc.; 2) complement 
auxotrophic deficiencies, or 3) supply critical nutrients not available from 
complex media, e.g., the gene encoding D-alanine racemase for Bacilli. 
Markers may be an inducible or non-inducible gene and will generally allow 

15 for positive selection. Non-limiting examples of markers include the 
ampicillin resistance marker (i.e., beta-lactamase), tetracycline resistance 
marker, neomycin/kanamycin resistance marker (i.e., neomycin 
phosphotransferase), dihydrofolate reductase, glutamine synthetase, and 
the like. The choice of the proper selectable marker will depend on the host 

20 cell, and appropriate markers for different hosts as understood by those of 
skill in the art. 

Suitable expression vectors for use with the present invention 
include, but are not limited to, pUC, pBluescript (Stratagene), pET 
(Novagen, Inc., Madison, Wl), and pREP (Invitrogen) plasmids. Vectors can 

25 contain one or more replication and inheritance systems for cloning or 
expression, one or more markers for selection in the host, e.g., antibiotic 
resistance, and one or more expression cassettes. The inserted coding 
sequences can be synthesized by standard methods, isolated from natural 
sources, or prepared as hybrids. Ligation of the coding sequences to 

30 transcriptional regulatory elements (e.g., promoters, enhancers, and/or 
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insulators) and/or to other amino acid encoding sequences can be carried 
out using established methods. 

Suitable cell-free expression systems for use with the present 
invention include, without limitation, rabbit reticulocyte lysate, wheat germ 
5 extract, canine pancreatic microsomal membranes, E coli S30 extract, and 
coupled transcription/translation systems (Promega Corp., Madison, Wl). 
These systems allow the expression of recombinant peptides upon the 
addition of cloning vectors, DNA fragments, or RNA sequences containing 
protein-coding regions and appropriate promoter elements. 

10 Non-limiting examples of suitable host cells include bacteria, archea, 

insect, fungi (e.g., yeast), plant, and animal cells (e.g., mammalian, 
especially human). Of particular interest are Escherichia coli, Bacillus 
subtilis, Saccharomyces cerevisiae, SF9 cells, C129 cells, 293 cells, 
Neurospora, and immortalized mammalian myeloid and lymphoid cell lines. 

15 Techniques for the propagation of mammalian cells in culture are well- 
known (see, Jakoby and Pastan (Eds), 1979, Cell Culture. Methods in 
Enzymology, volume 58, Academic Press, Inc., Harcourt Brace Jovanovich, 
NY). Examples of commonly used mammalian host cell lines are VERO and 
HeLa cells, CHO cells, and WI38, BHK, and COS cell lines, although it will 

20 be appreciated by the skilled practitioner that other cell lines may be used, 
e.g., to provide higher expression, or other features. 

Host cells can be transformed, transfected, or infected as appropriate 
by any suitable method including electroporation, calcium chloride-, lithium 
chloride-, lithium acetate/polyethylene glycol-, calcium phosphate-, DEAE- 

25 dextran-, liposome-mediated DNA uptake, spheroplasting, injection, 
microinjection, microprojectile bombardment, phage infection, viral infection, 
or other established methods. Alternatively, vectors containing the nucleic 
acids of interest can be transcribed in vitro, and the resulting RNA 
introduced into the host cell by well-known methods, e.g., by injection (see, 

30 Kubo et a/., 1988, FEBS Letts. 241:119). The cells into which have been 
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introduced nucleic acids described above are meant to also include the 
progeny of such cells. 

Nucleic acids encoding the peptides of the invention may be isolated 
directly from recombinant phage libraries (e.g., RAPIDLIB® or GRABLIB® 
5 libraries) described herein. Alternatively, the polymerase chain reaction 
(PCR) method can be used to produce nucleic acids of the invention, using 
the recombinant phage libraries as templates. Primers used for PCR can be 
synthesized using the sequence information provided herein and can further 
be designed to introduce appropriate new restriction sites, if desirable, to 

10 facilitate incorporation into a given vector for recombinant expression. 

Nucleic acids encoding the peptides of the present invention can also 
be produced by chemical synthesis, e.g., by the phosphoramidite method 
described by Beaucage et a/., 1981, Tetra. Letts. 22:1859-1862, or the 
triester method according to Matteucci et a/., 1981, J. Am. Chem. Soc., 

15 103:3185, and can performed on commercial, automated oligonucleotide 
synthesizers. A double-stranded fragment may be obtained from the single- 
stranded product of chemical synthesis either by synthesizing the 
complementary strand and annealing the strands together under appropriate 
conditions or by adding the complementary strand using DNA polymerase 

20 with an appropriate primer sequence. 

The nucleic acids encoding the peptides of the invention can be 
produced in large quantities by replication in a suitable host cell. Natural or 
synthetic nucleic acid fragments, comprising at least ten contiguous bases 
coding for a desired amino acid sequence can be incorporated into 

25 recombinant nucleic acid constructs, usually DNA constructs, capable of 
introduction into and replication in a prokaryotic or eukaryotic cell. Usually 
the nucleic acid constructs will be suitable for replication in a unicellular 
host, such as yeast or bacteria, but may also be intended for introduction to 
(with and without integration within the genome) cultured mammalian or 

30 plant or other eukaryotic cells, cell lines, tissues, or organisms. The 
purification of nucleic acids produced by the methods of the present 
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invention is described, for example, in Sambrook et a/., 1989; F.M. Ausubel 
et a/., 1992, Current Protocols in Molecular Biology, J. Wiley and Sons, New 
York, NY. 

These nucleic acids can encode variant or truncated forms of the 
5 peptides as well as the reference peptides shown in Figures 1-4, 8, and 9 
and Table 7, inter alia. Large quantities of the nucleic acids and peptides of 
the present invention may be prepared by expressing the nucleic acids or 
portions thereof in vectors or other expression vehicles in compatible 
prokaryotic or eukaryotic host cells. The most commonly used prokaryotic 

10 hosts are strains of Escherichia coli, although other prokaryotes, such as 
Bacillus subtilis or Pseudomonas may also be used. Mammalian or other 
eukaryotic host cells, such as those of yeast, filamentous fungi, plant, insect, 
or amphibian or avian species, may also be useful for production of the 
proteins of the present invention. For example, insect cell systems (i.e., 

15 lepidopteran host cells and baculovirus expression vectors) are particularly 
suited for large-scale protein production. 

Host cells carrying an expression vector (i.e., transformants or 
clones) are selected using markers depending on the mode of the vector 
construction. The marker may be on the same or a different DNA molecule, 

20 preferably the same DNA molecule. In prokaryotic hosts, the transformant 
may be selected, e.g., by resistance to ampicillin, tetracycline or other 
antibiotics. Production of a particular product based on temperature 
sensitivity may also serve as an appropriate marker. 

For some purposes, it is preferable to produce the peptide in a 

25 recombinant system in which the peptide contains an additional sequence 
(e.g., epitope or protein) tag that facilitates purification. Non-limiting 
examples of epitope tags include c-myc, haemagglutinin (HA), polyhistidine 
(6X-HIS)(SEQ ID NO: 1778), GLU-GLU, and DYKDDDDK (SEQ ID 
NO:1779) or DYKD (SEQ ID NO:1545; FLAG®) epitope tags. Non-limiting 

30 examples of protein tags include glutathione-S-transferase (GST), green 
fluorescent protein (GFP), and maltose binding protein (MBP). In one 
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approach, the coding sequence of a peptide can be cloned into a vector that 
creates a fusion with a sequence tag of interest. Suitable vectors include, 
without limitation, pRSET (Invitrogen Corp., San Diego, CA), pGEX 
(Amersham Pharmacia Biotech, Inc., Piscataway, NJ), pEGFP (CLONTECH 
5 Laboratories, Inc., Palo Alto, CA), and pMAL™ (New England BioLabs, Inc., 
Beverly, MA) plasmids. Following expression, the epitope or protein tagged 
peptide can be purified from a crude lysate of the translation system or host 
cell by chromatography on an appropriate solid-phase matrix. In some 
cases, it may be preferable to remove the epitope or protein tag (i.e., via 

1 0 protease cleavage) following purification. 

Methods for directly purifying peptides from sources such as cellular 
or extracellular lysates are well known in the art (see Harris and Angal, 
1989). Such methods include, without limitation, sodium dodecylsulfate- 
polyacrylamide gel electrophoresis (SDS-PAGE), preparative disc-gel 

15 electrophoresis, isoelectric focusing, high-performance liquid 
chromatography (HPLC), reversed-phase HPLC, gel filtration, ion exchange 
and partition chromatography, countercurrent distribution, and combinations 
thereof. Peptides can be purified from many possible sources, for example, 
plasma, body tissues, or body fluid lysates derived from human or animal, 

20 including mammalian, bird, fish, and insect sources. 

Antibody-based methods may also be used to purify peptides. 
Antibodies that recognize these peptides or fragments derived therefrom 
can be produced and isolated. The peptide can then be purified from a 
crude lysate by chromatography on an antibody-conjugated solid-phase 

25 matrix (see Harlow and Lane, 1998). 

2. Chemical Synthesis Of Peptides 

Alternately, peptides may be chemically synthesized by commercially 
available automated procedures, including, without limitation, exclusive solid 
phase synthesis, partial solid phase methods, fragment condensation or 
30 classical solution synthesis. The peptides are preferably prepared by solid- 
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phase peptide synthesis; for example, as described by Merrifield (1965; 
1997). 

According to methods known in the art, peptides can be chemically 
synthesized by commercially available automated procedures, including, 
5 without limitation, exclusive solid phase synthesis, partial solid phase 
methods, fragment condensation, classical solution synthesis. In addition, 
recombinant and synthetic methods of peptide production can be combined 
to produce semi-synthetic peptides. The peptides of the invention are 
preferably prepared by solid phase peptide synthesis as described by 

10 Merrifield, 1963, J. Am. Chem. Soc. 85:2149; 1997. In one embodiment, 
synthesis is carried out with amino acids that are protected at the alpha- 
amino terminus. Trifunctional amino acids with labile side-chains are also 
protected with suitable groups to prevent undesired chemical reactions from 
occurring during the assembly of the peptides. The alpha-amino protecting 

1 5 group is selectively removed to allow subsequent reaction to take place at 
the amino-terminus. The conditions for the removal of the alpha-amino 
protecting group do not remove the side-chain protecting groups. 

The alpha-amino protecting groups are those known to be useful in 
the art of stepwise peptide synthesis. Included are acyl type protecting 

20 groups, e.g., formyl, trifluoroacetyl, acetyl, aromatic urethane type protecting 
groups, e.g., benzyloxycarbonyl (Cbz), substituted benzyloxycarbonyl and 9- 
fluorenylmethyloxycarbonyl (Fmoc), aliphatic urethane protecting groups, 
e.g., t-butyloxycarbonyl (Boc), isopropyloxycarbonyl, cyclohexyloxycarbonyl, 
and alkyl type protecting groups, e.g., benzyl, triphenylmethyl. The 

25 preferred protecting group is Boc. The side-chain protecting groups for Tyr 
include tetrahydropyranyl, tert-butyl, trityl, benzyl, Cbz, 4-Br-Cbz and 2,6- 
dichlorobenzyl. The preferred side-chain protecting group for Tyr is 2,6- 
dichlorobenzyl. The side-chain protecting groups for Asp include benzyl, 
2,6-dichlorobenzyl, methyl, ethyl, and cyclohexyl. The preferred side-chain 

30 protecting group for Asp is cyclohexyl. The side-chain protecting groups for 
Thr and Ser include acetyl, benzoyl, trityl, tetrahydropyranyl, benzyl, 2,6- 
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dichlorobenzyl, and Cbz. The 'preferred protecting group for Thr and Ser is 
benzyl. The side-chain protecting groups for Arg include nitro, Tos, Cbz, 
adamantyloxycarbonyl, and Boc. The preferred protecting group for Arg is 
Tos. The side-chain amino group of Lys can be protected with Cbz, 2-CI- 
5 Cbz, Tos, or Boc. The 2-CI-Cbz group is the preferred protecting group for 
Lys. 

The side-chain protecting groups selected must remain intact during 
coupling and not be removed during the deprotection of the amino-terminus 
protecting group or during coupling conditions. The side-chain protecting 

10 groups must also be removable upon the completion of synthesis, using 
reaction conditions that will not alter the finished peptide. 

Solid phase synthesis is usually carried out from the carboxy- 
terminus by coupling the alpha-amino protected (side-chain protected) 
amino acid to a suitable solid support. An ester linkage is formed when the 

15 attachment is made to a chloromethyl or hydroxymethyl resin, and the 
resulting peptide will have a free carboxyl group at the C-terminus. 
Alternatively, when a benzhydrylamine or p-methylbenzhydrylamine resin is 
used, an amide bond is formed and the resulting peptide will have a 
carboxamide group at the C-terminus. These resins are commercially 

20 available, and their preparation has described by Stewart et a/., 1984, Solid 
Phase Peptide Synthesis (2nd Edition), Pierce Chemical Co., Rockford, IL. 

The C-terminal amino acid, protected at the side chain if necessary 
and at the alpha-amino group, is coupled to the benzhydrylamine resin using 
various activating agents including dicyclohexylcarbodiimide (DCC), N,N'- 

25 diisopropyl-carbodiimide and carbonyldiimidazole. Following the attachment 
to the resin support, the alpha-amino protecting group is removed using 
trifluoroacetic acid (TFA) or HCI in dioxane at a temperature between 0 and 
25°C. Dimethylsulfide is added to the TFA after the introduction of 
methionine (Met) to suppress possible S-alkylation. After removal of the 

30 alpha-amino protecting group, the remaining protected amino acids are 
coupled stepwise in the required order to obtain the desired sequence. 
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Various activating agents can be used for the coupling reactions including 
DCC,N,N'-diisopropyl-carbodiimide, benzotriazol-1-yl-oxy-tris- 
(dimethylamino) phosphonium hexa-fluorophosphate (BOP) and DCC- 
hydroxybenzotriazole (HOBt). Each protected amino acid is used in excess 
5 (>2.0 equivalents), and the couplings are usually carried out in N- 
methylpyrrolidone (NMP) or in DMF, CH2CI2 or mixtures thereof. The 
extent of completion of the coupling reaction is monitored at each stage, 
e.g., by the ninhydrin reaction as described by Kaiser et a/., 1970, Anal. 
Biochem. 34:595. In cases where incomplete coupling is found, the 

10 coupling reaction is repeated. The coupling reactions can be performed 
automatically with commercially available instruments. 

After the entire assembly of the desired peptide, the peptide-resin is 
cleaved with a reagent such as liquid HF for 1-2 hours at 0°C, which cleaves 
the peptide from the resin and removes all side-chain protecting groups. A 

15 scavenger such as anisole is usually used with the liquid HF to prevent 
cations formed during the cleavage from alkylating the amino acid residues 
present in the peptide. The peptide-resin can be deprotected with 
TFA/dithioethane prior to cleavage if desired. 

Side-chain to side-chain cyclization on the solid support requires the 

20 use of an orthogonal protection scheme which enables selective cleavage of 
the side-chain functions of acidic amino acids (e.g., Asp) and the basic 
amino acids (e.g., Lys). The 9-fluorenylmethyl (Fm) protecting group for the 
side-chain of Asp and the 9-fluorenylmethyloxycarbonyl (Fmoc) protecting 
group for the side-chain of Lys can be used for this purpose. In these 

25 cases, the side-chain protecting groups of the Boc-protected peptide-resin 
are selectively removed with piperidine in DMF. Cyclization is achieved on 
the solid support using various activating agents including DCC, DCC/HOBt, 
or BOP. The HF reaction is carried out on the cyclized peptide-resin as 
described above. 
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3. Peptide Libraries 

Peptide libraries produced and screened according to the present 
invention are useful in providing new ligands for IR and IGF-1R. Peptide 
libraries can be designed and panned according to methods described in 
5 detail herein, and methods generally available to those in the art (see, e.g., 
U.S. Patent No. 5,723,286 issued March 3, 1998 to Dower et a/.). In one 
aspect, commercially available phage display libraries can be used (e.g., 
RAPIDLIB® or GRABLIB®, DGI BioTechnologies, Inc., Edison, NJ; Ph.D. 
C7C Disulfide Constrained Peptide Library, New England Biolabs). In 

10 another aspect, an oligonucleotide library can be prepared according to 
methods known in the art, and inserted into an appropriate vector for peptide 
expression. For example, vectors encoding a bacteriophage structural 
protein, preferably an accessible phage protein, such as a bacteriophage 
coat protein, can be used. Although one skilled in the art will appreciate that 

15 a variety of bacteriophage may be employed in the present invention, in 
preferred embodiments the vector is, or is derived from, a filamentous 
bacteriophage, such as, for example, f1, fd, Pf1, M13, etc. In particular, the 
fd-tet vector has been extensively described in the literature (see, e.g., 
Zacher et a/., 1980, Gene 9:127-140; Smith et a/., 1985, Science 228:1315- 

20 1317; Parmley and Smith, 1988, Gene 73:305-318). 

The phage vector is chosen to contain or is constructed to contain a 
cloning site located in the 5' region of the gene encoding the bacteriophage 
structural protein, so that the peptide is accessible to receptors in an affinity 
enrichment procedure as described hereinbelow. The structural phage 

25 protein is preferably a coat protein. An example of an appropriate coat 
protein is pill. A suitable vector may allow oriented cloning of the 
oligonucleotide sequences that encode the peptide so that the peptide is 
expressed at or within a distance of about 100 amino acid residues of the N- 
terminus of the mature coat protein. The coat protein is typically expressed 

30 as a preprotein, having a leader sequence. 
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Thus, desirably the oligonucleotide library is inserted so that the N- 
terminus of the processed bacteriophage outer protein is the first residue of 
the peptide, i.e., between the S'-terminus of the sequence encoding the 
leader protein and the 5-terminus of the sequence encoding the mature 
5 protein or a portion of the 5' terminus. The library is constructed by cloning 
an oligonucleotide which contains the variable region of library members 
(and any spacers, as discussed below) into the selected cloning site. Using 
known recombinant DNA techniques (see generally, Sambrook et a/., 1989, 
Molecular Cloning, A Laboratory Manual, 2d ed., Cold Spring Harbor 

10 Laboratory Press, Cold Spring Harbor, N.Y., 1989), an oligonucleotide may 
be constructed which, inter a//a; 1) removes unwanted restriction sites and 
adds desired ones; 2) reconstructs the correct portions of any sequences 
which have been removed (such as a correct signal peptidase site, for 
example); 3) inserts the spacer residues, if any; and/or 4) corrects the 

1 5 translation frame (if necessary) to produce active, infective phage. 

The central portion of the oligonucleotide will generally contain one or 
more IR and/or IGF-1R binding sequences and, optionally, spacer 
sequences. The sequences are ultimately expressed as peptides (with or 
without spacers) fused to or in the N-terminus of the mature coat protein on 

20 the outer, accessible surface of the assembled bacteriophage particles. The 
size of the library will vary according to the number of variable codons, and 
hence the size of the peptides, which are desired. Generally the library will 
be at least about 10 6 members, usually at least 10 7 , and typically 10 8 or 
more members. To generate the collection of oligonucleotides which forms 

25 a series of codons encoding a random collection of amino acids and which 
is ultimately cloned into the vector, a codon motif is used, such as (NNK) X , 
where N may be A, C, G, or T (nominally equimolar), K is G or T (nominally 
equimolar), and x is typically up to about 5, 6, 7, 8, or more, thereby 
producing libraries of penta-, hexa-, hepta-, and octa-peptides or larger. 

30 The third position may also be G or C, designated "S". Thus, NNK or NNS 
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1) code for all the amino acids; 2) code for only one stop codon; and 3) 
reduce the range of codon bias from 6:1 to 3:1 . 

It should be understood that, with longer peptides, the size of the 
library that is generated may become a constraint in the cloning process. 
5 The expression of peptides from randomly generated mixtures of 
oligonucleotides in appropriate recombinant vectors is known in the art (see, 
e.g., Oliphant et a/., Gene 44:177-183). For example, the codon motif 
(NNK) 6 produces 32 codons, one for each of 12 amino acids, two for each of 
five amino acids, three for each of three amino acids and one (amber) stop 

10 codon. Although this motif produces a codon distribution as equitable as 
available with standard methods of oligonucleotide synthesis, it results in a 
bias against peptides containing one-codon residues. In particular, a 
complete collection of hexacodons contains one sequence encoding each 
peptide made up of only one-codon amino acids, but contains 729 (3 6 ) 

15 sequences encoding each peptide with only three-codon amino acids. 

An alternative approach to minimize the bias against one-codon 
residues involves the synthesis of 20 activated trinucleotides, each 
representing the codon for one of the 20 genetically encoded amino acids. 
These are synthesized by conventional means, removed from the support 

20 while maintaining the base and 5-OH-protecting groups, and activated by 
the addition of 3'O-phosphoramidite (and phosphate protection with b- 
cyanoethyl groups) by the method used for the activation of 
mononucleosides (see, generally, McBride and Caruthers, 1983, 
Tetrahedron Letters 22:245). Degenerate oligocodons are prepared using 

25 these trimers as building blocks. The trimers are mixed at the desired molar 
ratios and installed in the synthesizer. The ratios will usually be 
approximately equimolar, but may be a controlled unequal ratio to obtain the 
over- to under-representation of certain amino acids coded for by the 
degenerate oligonucleotide collection. The condensation of the trimers to 

30 form the oligocodons is done essentially as described for conventional 
synthesis employing activated mononucleosides as building blocks (see, 
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e.g., Atkinson and Smith, 1984, Oligonucleotide Synthesis, M.J. Gait, Ed., p. 
35-82). This procedure generates a population of oligonucleotides for 
cloning that is capable of encoding an equal distribution (or a controlled 
unequal distribution) of the possible peptide sequences. Advantageously, 
5 this approach may be employed in generating longer peptide sequences, 
since the range of bias produced by the (NNK) 6 motif increases by three-fold 
with each additional amino acid residue. 

When the codon motif is (NNK) X , as defined above, and when x 
equals 8, there are 2.6. x 10 10 possible octa-peptides. A library containing 

10 most of the octa-peptides may be difficult to produce. Thus, a sampling of 
the octa-peptides may be accomplished by constructing a subset library 
using up to about 10% of the possible sequences, which subset of 
recombinant bacteriophage particles is then screened. If desired, to extend 
the diversity of a subset library, the recovered phage subset may be 

15 subjected to mutagenesis and then subjected to subsequent rounds of 
screening. This mutagenesis step may be accomplished in two general 
ways: the variable region of the recovered phage may be mutagenized, or 
additional variable amino acids may be added to the regions adjoining the 
initial variable sequences. 

20 To diversify around active peptides (i.e., binders) found in early 

rounds of panning, the positive phage can sequenced to determine the 
identity of the active peptides. Oligonucleotides can then be synthesized 
based on these peptide sequences. The syntheses are done with a low 
level of all bases incorporated at each step to produce slight variations of 

25 the primary oligonucleotide sequences. This mixture of (slightly) degenerate 
oligonucleotides can then be cloned into the affinity phage by methods 
known to those in the art. This method produces systematic, controlled 
variations of the starting peptide sequences as part of a secondary library. It 
requires, however, that individual positive phage be sequenced before 

30 mutagenesis, and thus is useful for expanding the diversity of small numbers 
of recovered phage. 
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An alternate approach to diversify the selected phage allows the 
mutagenesis of a pool, or subset, of recovered phage. In accordance with 
this approach, phage recovered from panning are pooled and single 
stranded DNA is isolated. The DNA is mutagenized by treatment with, e.g., 
5 nitrous acid, formic acid, or hydrazine. These treatments produce a variety 
of damage to the DNA. The damaged DNA is then copied with reverse 
transcriptase, which misincorporates bases when it encounters a site of 
damage. The segment containing the sequence encoding the receptor- 
binding peptide is then isolated by cutting with restriction nuclease(s) 

10 specific for sites flanking the peptide coding sequence. This mutagenized 
segment is then recloned into undamaged vector DNA, the DNA is 
transformed into cells, and a secondary library according to known methods. 
General mutagenesis methods are known in the art (see Myers et a/., 1985, 
Nucl. Acids Res. 13:3131-3145; Myers et a/., 1985, Science 229:242-246; 

15 Myers, 1989, Current Protocols in Molecular Biology Vol. /, 8.3.1-8.3.6, F. 
Ausubel et a/., eds, J. Wiley and Sons, New York). 

In another general approach, the addition of amino acids to a peptide 
or peptides found to be active, can be carried out using various methods. In 
one, the sequences of peptides selected in early panning are determined 

20 individually and new oligonucleotides, incorporating the determined 
sequence and an adjoining degenerate sequence, are synthesized. These 
are then cloned to produce a secondary library. Alternatively, methods can 
be used to add a second IR or IGF-1R binding sequence to a pool of 
peptide-bearing phage. In accordance with one method, a restriction site is 

25 installed next to the first IR or IGF-1R binding sequence. Preferably, the 
enzyme should cut outside of its recognition sequence. The recognition site 
may be placed several bases from the first binding sequence. To insert a 
second IR or IGF-1R binding sequence, the pool of phage DNA is digested 
and blunt-ended by filling in the overhang with Klenow fragment. Double- 

30 stranded, blunt-ended, degenerately synthesized oligonucleotides are then 
ligated into this site to produce a second binding sequence juxtaposed to the 
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first binding sequence. This secondary library is then amplified and 
screened as before. 

While in some instances it may be appropriate to synthesize longer 
peptides to bind certain receptors, in other cases it may be desirable to 
5 provide peptides having two or more IR or IGF-1R binding sequences 
separated by spacer (e.g., linker) residues. For example, the binding 
sequences may be separated by spacers that allow the regions of the 
peptides to be presented to the receptor in different ways. The distance 
between binding regions may be as little as 1 residue, or at least 2-20 

10 residues, or up to at least 100 residues. Preferred spacers are 3, 6, 9, 12, 
15, or 18 residues in length. For probing large binding sites or tandem 
binding sites (e.g., Site 1 and Site 2 of IR), the binding regions may be 
separated by a spacer of residues of up to 20 to 30 amino acids. The 
number of spacer residues when present will typically be at least 2 residues, 

15 and often will be less than 20 residues. 

The oligonucleotide library may have binding sequences which are 
separated by spacers (e.g., linkers), and thus may be represented by the 
formula: (NNK)y - (abc) n - (NNK) 2 where N and K are as defined previously 
(note that S as defined previously may be substituted for K), and y+z is 

20 equal to about 5, 6, 7, 8, or more, a, b and c represent the same or different 
nucleotides comprising a codon encoding spacer amino acids, n is up to 
about 3, 6, 9, or 12 amino acids, or more. The spacer residues may be 
somewhat flexible, comprising oligo-glycine, or oligo-glycine-glycine-serine, 
for example, to provide the diversity domains of the library with the ability to 

25 interact with sites in a large binding site relatively unconstrained by 
attachment to the phage protein. Rigid spacers, such as, e.g., oligo-proline, 
may also be inserted separately or in combination with other spacers, 
including glycine spacers. It may be desired to have the IR or IGF-1R 
binding sequences close to one another and use a spacer to orient the 

30 binding sequences with respect to each other, such as by employing a turn 
between the two sequences, as might be provided by a spacer of the 
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sequence glycine-proline-glycine, for example. To add stability to such a 
turn, it may be desirable or necessary to add cysteine residues at either or 
both ends of each variable region. The cysteine residues would then form 
disulfide bridges to hold the variable regions together in a loop, and in this 
5 fashion may also serve to mimic a cyclic peptide. Of course, those skilled in 
the art will appreciate that various other types of covalent linkages for 
cyclization may also be used. 

Spacer residues as described above may also be situated on either 
or both ends of the IR or IGF-1R binding sequences. For instance, a cyclic 

10 peptide may be designed without an intervening spacer, by having a 
cysteine residue on both ends of the peptide. As described above, flexible 
spacers, e.g., oligo-glycine, may facilitate interaction of the peptide with the 
selected receptors. Alternatively, rigid spacers may allow the peptide to be 
presented as if on the end of a rigid arm, where the number of residues, 

1 5 e.g., proline residues, determines not only the length of the arm but also the 
direction for the arm in which the peptide is oriented. Hydrophilic spacers, 
made up of charged and/or uncharged hydrophilic amino acids, (e.g., Thr, 
His, Asn, Gin, Arg, Glu, Asp, Met, Lys, etc.), or hydrophobic spacers of 
hydrophobic amino acids (e.g., Phe, Leu, lie, Gly, Val, Ala, etc.) may be 

20 used to present the peptides to receptor binding sites with a variety of local 
environments. 

Notably, some peptides, because of their size and/or sequence, may 
cause severe defects in the infectivity of their carrier phage. This causes a 
loss of phage from the population during reinfection and amplification 

25 following each cycle of panning. To minimize problems associated with 
defective infectivity, DNA prepared from the eluted phage can be 
transformed into appropriate host cells, such as, e.g., E. coll preferably by 
electroporation (see, e.g., Dower et a/., Nucl. Acids Res. 16:6127-6145), or 
well known chemical means. The cells are cultivated for a period of time 

30 sufficient for marker expression, and selection is applied as typically done 
for DNA transformation. The colonies are amplified, and phage harvested 



WO 03/070747 



PCT7US02/30312 



-67- 

for affinity enrichment in accordance with established methods. Phage 
identified in the affinity enrichment may be re-amplified by infection into the 
host cells. The successful transformants are selected by growth in an 
appropriate antibiotic(s), e.g., tetracycline or ampicillin. This may be done 
5 on solid or in liquid growth medium. 

For growth on solid medium, the cells are grown at a high density 
(about 10 8 to 10 9 transformants per m 2 ) on a large surface of, for example, 
L-agar containing the selective antibiotic to form essentially a confluent 
lawn. The cells and extruded phage are scraped from the surface and 

10 phage are prepared for the first round of panning (see, e.g., Parmley and 
Smith, 1988, Gene 73:305-318). For growth in liquid culture, cells may be 
grown in L-broth and antibiotic through about 10 or more doublings. The 
phage are harvested by standard procedures (see Sambrook et a/., 1989, 
Molecular Cloning, 2 nd ed.). Growth in liquid culture may be more 

15 convenient because of the size of the libraries, while growth on solid media 
likely provides less chance of bias during the amplification process. 

For affinity enrichment of desired clones, generally about 10 3 to 10 4 
library equivalents (a library equivalent is one of each recombinant; 10 4 
equivalents of a library of 10 9 members is 10 9 x 10 4 = 10 13 phage), but 

20 typically at least 10 2 library equivalents, up to about 10 5 to 10 6 , are 
incubated with a receptor (or portion thereof) to which the desired peptide is 
sought. The receptor is in one of several forms appropriate for affinity 
enrichment schemes. In one example the receptor is immobilized on a 
surface or particle, and the library of phage bearing peptides is then panned 

25 on the immobilized receptor generally according to procedures known in the 
art. In an alternate scheme, a receptor is attached to a recognizable ligand 
(which may be attached via a tether). A specific example of such a ligand is 
biotin. The receptor, so modified, is incubated with the library of phage and 
binding occurs with both reactants in solution. The resulting complexes are 

30 then bound to streptavidin (or avidin) through the biotin moiety. The 
streptavidin may be immobilized on a surface such as a plastic plate or on 
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particles, in which case the complexes 

(phage/peptide/receptor/biotin/streptavidin) are physically retained; or the 
streptavidin may be labeled, with a fluorophor, for example, to tag the active 
phage/peptide for detection and/or isolation by sorting procedures, e.g., on a 
5 fluorescence-activated cell sorter. 

Phage that associate with IR or IGF-1R via non-specific interactions 
are removed by washing. The degree and stringency of washing required 
will be determined for each receptor/peptide of interest. A certain degree of 
control can be exerted over the binding characteristics of the peptides 

10 recovered by adjusting the conditions of the binding incubation and the 
subsequent washing. The temperature, pH, ionic strength, divalent cation 
concentration, and the volume and duration of the washing will select for 
peptides within particular ranges of affinity for the receptor. Selection based 
on slow dissociation rate, which is usually predictive of high affinity, is the 

15 most practical route. This may be done either by continued incubation in the 
presence of a saturating amount of free ligand, or by increasing the volume, 
number, and length of the washes. In each case, the rebinding of 
dissociated peptide-phage is prevented, and with increasing time, peptide- 
phage of higher and higher affinity are recovered. Additional modifications 

20 of the binding and washing procedures may be applied to find peptides that 
bind receptors under special conditions. Once a peptide sequence that 
imparts some affinity and specificity for the receptor molecule is known, the 
diversity around this binding motif may be embellished. For instance, 
variable peptide regions may be placed on one or both ends of the identified 

25 sequence. The known sequence may be identified from the literature, or 
may be derived from early rounds of panning in the context of the present 
invention. 



30 



G. Screening Assays 

In another embodiment of this invention, screening assays to identify 
pharmacologically active ligands at IR and/or IGF-1R are provided. Ligands 
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may encompass numerous chemical classes, though typically they are 
organic molecules, preferably small organic compounds having a molecular 
weight of more than 50 and less than about 2,500 daltons. Such ligands 
can comprise functional groups necessary for structural interaction with 
5 proteins, particularly hydrogen bonding, and typically include at least an 
amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the 
functional chemical groups. Ligands often comprise cyclical carbon or 
heterocyclic structures and/or aromatic or polyaromatic structures 
substituted with one or more of the above functional groups. Ligands can 

10 also comprise biomolecules including peptides, saccharides, fatty acids, 
steroids, purines, pyrimidines, derivatives, structural analogs, or 
combinations thereof. 

Ligands may include, for example, 1) peptides such as soluble 
peptides, including Ig-tailed fusion peptides and members of random peptide 

15 libraries (see, e.g., Lam et al., 1991, Nature 354:82-84; Houghten ef a/., 
1991, Nature 354:84-86) and combinatorial chemistry-derived molecular 
libraries made of D- and/or L-configuration amino acids; 2) phosphopeptides 
(e.g., members of random and partially degenerate, directed 
phosphopeptide libraries, see, e.g., Songyang et a/., 1993, Cell 72:767-778); 

20 3) antibodies (e.g., polyclonal, monoclonal, humanized, anti-idiotypic, 
chimeric, and single chain antibodies as well as Fab, F(ab , ) 2 , Fab 
expression library fragments, and epitope-binding fragments of antibodies); 
and 4) small organic and inorganic molecules. 

Ligands can be obtained from a wide variety of sources including 

25 libraries of synthetic or natural compounds. Synthetic compound libraries 
are commercially available from, for example, Maybridge Chemical Co. 
(Trevillet, Cornwall, UK), Comgenex (Princeton, NJ), Brandon Associates 
(Merrimack, NH), and Microsource (New Milford, CT). A rare chemical 
library is available from Aldrich Chemical Company, Inc. (Milwaukee, Wl). 

30 Natural compound libraries comprising bacterial, fungal, plant or animal 
extracts are available from, for example, Pan Laboratories (Bothell, WA). In 
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addition, numerous means are available for random and directed synthesis 
of a wide variety of organic compounds and biomolecules, including 
expression of randomized oligonucleotides. 

Alternatively, libraries of natural compounds in the form of bacterial, 
5 fungal, plant and animal extracts can be readily produced. Methods for the 
synthesis of molecular libraries are readily available (see, e.g., DeWitt et a/., 
1993, Proc. Natl. Acad. Sci. USA 90:6909; Erb et a/., 1994, Proc. Natl. Acad. 
Sci. USA 91:11422; Zuckermann et a/., 1994, J. Med. Chem. 37:2678; Cho 
et a/., 1993, Science 261:1303; Carell et a/., 1994, Angew. Chem. Int. Ed. 

10 Engl. 33:2059; Carell et a/., 1994, Angew. Chem. Int. Ed. Engl. 33:2061; and 
in Gallop et a/., 1994, J. Med. Chem. 37:1233). In addition, natural or 
synthetic compound libraries and compounds can be readily modified 
through conventional chemical, physical and biochemical means (see, e.g., 
Blondelle et a/., 1996, Trends in Biotech. 14:60), and may be used to 

15 produce combinatorial libraries. In another approach, previously identified 
pharmacological agents can be subjected to directed or random chemical 
modifications, such as acylation, alkylation, esterification, amidification, and 
the analogs can be screened for IR-modulating activity. 

Numerous methods for producing combinatorial libraries are known in 

20 the art, including those involving biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; the 'one-bead one-compound' library method; and 
synthetic library methods using affinity chromatography selection. The 
biological library approach is limited to polypeptide or peptide libraries, while 

25 the other four approaches are applicable to polypeptide, peptide, non- 
peptide oligomer, or small molecule libraries of compounds (K. S. Lam, 
1997, Anticancer Drug Des. 12:145). 

Libraries may be screened in solution by methods generally known in 
the art for determining whether ligands competitively bind at a common 

30 binding site. Such methods may including screening libraries in solution 
(e.g., Houghten, 1992, Biotechniques 13:412-421), or on beads (Lam, 1991, 
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Nature 354:82-84), chips (Fodor, 1993, Nature 364:555-556), bacteria or 
spores (Ladner U.S. Pat. No. 5,223,409), plasmids (Cull et a/., 1992, Proc. 
Natl. Acad. ScL USA 89:1865-1869), or on phage (Scott and Smith, 1990, 
Science 249:386-390; Devlin, 1990, Science 249:404-406; Cwirla et a/., 
5 1990, Proc. Natl. Acad. Sci. USA 97:6378-6382; Felici, 1991, J. Mol. Biol. 
222:301-310; Ladner, supra). 

Where the screening assay is a binding assay, IR, or one of the IR- 
binding peptides disclosed herein, may be joined to a label, where the label 
can directly or indirectly provide a detectable signal. Various labels include 

10 radioisotopes, fluorescent molecules, chemiluminescent molecules, 
enzymes, specific binding molecules, particles, e.g., magnetic particles, and 
the like. Specific binding molecules include pairs, such as biotin and 
streptavidin, digoxin and antidigoxin, etc. For the specific binding members, 
the complementary member would normally be labeled with a molecule that 

1 5 provides for detection, in accordance with known procedures. 

A variety of other reagents may be included in the screening assay. 
These include reagents like salts, neutral proteins, e.g., albumin, detergents, 
etc., which are used to facilitate optimal protein-protein binding and/or 
, reduce non-specific or background interactions. Reagents that improve the 

20 efficiency of the assay, such as protease inhibitor nuclease inhibitors, anti- 
microbial agents, etc., may be used. The components are added in any 
order that produces the requisite binding. Incubations are performed at any 
temperature that facilitates optimal activity, typically between 4° and 40°C. 
Incubation periods are selected for optimum activity, but may also be 

25 optimized to facilitate rapid high-throughput screening. Normally, between 
0.1 and 1 hr will be sufficient. In general, a plurality of assay mixtures is run 
in parallel with different test agent concentrations to obtain a differential 
response to these concentrations. Typically, one of these concentrations 
serves as a negative control, i.e., at zero concentration or below the level of 

30 detection. 
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The screening assays provided in accordance with this invention are 
based on those disclosed in International application WO 96/04557, which is 
incorporated herein in its entirety. Briefly, WO 96/04557 discloses the use 
of reporter peptides that bind to active sites on targets and possess agonist 
5 or antagonist activity at the target. These reporters are identified from 
recombinant libraries and are either peptides with random amino acid 
sequences or variable antibody regions with at least one CDR region that 
has been randomized (rVab). The reporter peptides may be expressed in 
cell recombinant expression systems, such as for example in E. co//, or by 

10 phage display (see WO 96/04557 and Kay et al. 1996, Mol. Divers. 
1(2): 139-40, both of which are incorporated herein by reference). The 
reporters identified from the libraries may then be used in accordance with 
this invention either as therapeutics themselves, or in competition binding 
assays to screen for other molecules, preferably small, active molecules, 

15 which possess similar properties to the reporters and may be developed as 
drug candidates to provide agonist or antagonist activity. Preferably, these 
small organic molecules are orally active. 

The basic format of an in vitro competitive receptor binding assay as 
the basis of a heterogeneous screen for small organic molecular 

20 replacements for insulin may be as follows: occupation of the active site of 
IR is quantified by time-resolved fluorometric detection (TRFD) with 
streptavidin-labeled europium (saEu) complexed to biotinylated peptides 
(bP). In this assay, saEu forms a ternary complex with bP and IR (i.e., 
IR:bP:saEu complex). The TRFD assay format is well established, 

25 sensitive, and quantitative (Tompkins et al., 1993, J. Immunol. Methods 
163:209-216). The assay can use a single-chain antibody or a biotinylated 
peptide. Furthermore, both assay formats faithfully report the competition of 
the biotinylated ligands binding to the active site of I R by insulin. 

In these assays, soluble IR is coated on the surface of microtiter 

30 wells, blocked by a solution of 0.5% bovine serum albumin (BSA) and 2% 
non-fat milk in PBS, and then incubated with biotinylated peptide or rVab. 
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Unbound bP is then washed away and saEu is added to complex with 
receptor-bound bP. Upon addition of the acidic enhancement solution, the 
bound europium is released as free Eu 3+ which rapidly forms a highly 
fluorescent and stable complex with components of the enhancement 
5 solution. The IR:bP bound saEu is then converted into its highly fluorescent 
state and detected by a detector such as Wallac Victor II (EG&G Wallac, 
Inc.) 

Phage display libraries can also be screened for ligands that bind to 
IR or IGF-1R, as described above. Details of the construction and analyses 

10 of these libraries, as well as the basic procedures for biopanning and 
selection of binders, have been published (see, e.g., WO 96/04557; 
Mandecki et a/., 1997, Display Technologies - Novel Targets and 
Strategies, P. Guttry (ed), International Business Communications, Inc. 
Southborogh, MA, pp. 231-254; Ravera et a/., 1998, Oncogene 16:1993- 

15 1999; Scott and Smith, 1990, Science 249:386-390); Grihalde et a/., 1995, 
Gene 166:187-195; Chen et a/., 1996, Proc. Natl. Acad. Sci. USA 93:1997- 
2001; Kay et a/., 1993, Gene 128:59-65; Carcamo et a/., 1998, Proc. Natl. 
Acad. Sci. USA 95:11146-11151; Hoogenboom, 1997, Trends BiotechnoL 
15:62-70; Rader and Barbas, 1997, Curr. Opin. BiotechnoL 8:503-508; all of 

20 which are incorporated herein by reference). 

The designing of mimetics to a known pharmaceutical^ active 
compound is a known approach to the development of pharmaceuticals 
based on a "lead" compound. This might be desirable where the active 
compound is difficult or expensive to synthesize or where it is unsuitable for 

25 a particular method of administration, e.g., peptides are generally unsuitable 
active agents for oral compositions as they tend to be quickly degraded by 
proteases in the alimentary canal. Mimetic design, synthesis, and testing 
are generally used to avoid large-scale screening of molecules for a target 
property. 

30 There are several steps commonly taken in the design of a mimetic 

from a compound having a given target property. First, the particular parts 
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of the compound that are critical and/or important in determining the target 
property are determined. In the case of a peptide, this can be done by 
systematically varying the amino acid residues in the peptide (e.g., by 
substituting each residue in turn). These parts or residues constituting the 
5 active region of the compound are known as its "pharmacophore". 

Once the pharmacophore has been found, its structure is modeled 
according to its physical properties (e.g., stereochemistry, bonding, size, 
and/or charge), using data from a range of sources (e.g., spectroscopic 
techniques, X-ray diffraction data, and NMR). Computational analysis, 

10 similarity mapping (which models the charge and/or volume of a 
pharmacophore, rather than the bonding between atoms), and other 
techniques can be used in this modeling process. 

In a variant of this approach, the three dimensional structure of the 
ligand and its binding partner are modeled. This can be especially useful 

15 where the ligand and/or binding partner change conformation on binding, 
allowing the model to take account of this in the design of the mimetic. 

A template molecule is then selected, and chemical groups that 
mimic the pharmacophore can be grafted onto the template. The template 
molecule and the chemical groups grafted on to it can conveniently be 

20 selected so that the mimetic is easy to synthesize, is likely to be 
pharmacologically acceptable, does not degrade in vivo, and retains the 
biological activity of the lead compound. The mimetics found are then 
screened to ascertain the extent they exhibit the target property, or to what 
extent they inhibit it. Further optimization or modification can then be carried 

25 out to arrive at one or more final mimetics for in vivo or clinical testing. 

This invention provides specific IR and IGF-1R amino acid sequences 
that function as either agonists or antagonists at IR and/or IGF-1R. 
Additional sequences may be obtained in accordance with the procedures 
described herein. 
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H. Use of the Peptides Provided by this Invention 

The IR and IGF-1R agonist and antagonist peptides provided by this 
invention are useful as lead compounds for identifying other more potent or 
selective therapeutics, assay reagents for identifying other useful ligands by, 
5 for example, competition screening assays, as research tools for further 
analysis of IR and IGF-1R, and as potential therapeutics in pharmaceutical 
compositions. In one embodiment, one or more of the disclosed peptides 
can be provided as components in a kit for identifying other ligands (e.g., 
small, organic molecules) that bind to IR or IGF-1R. Such kits may also 

10 comprise IR or IGF-1R, or functional fragments thereof. The peptide and 
receptor components of the kit may be labeled (e.g., by radioisotopes, 
fluorescent molecules, chemiluminescent molecules, enzymes or other 
labels), or may be unlabeled and labeling reagents may be provided. The 
kits may also contain peripheral reagents such as buffers, stabilizers, etc. 

1 5 Instructions for use can also be provided. 

In another embodiment, the peptide sequences provided by this 
invention can be used to design secondary peptide libraries, which are 
derived from the peptide sequences, and include members that bind to Site 
1 and/or Site 2 of IR or IGF-1R. Such libraries can be used to identify 

20 sequence variants that increase or modulate the binding and/or activity of 
the original peptide at IR or IGF-1R, as described in the related applications 
of Beasley et al International Application PCT/US00/08528, filed March 29, 
2000, and Beasley et a/., U.S. Application Serial No. 09/538,038, filed March 
29, 2000, in accordance with well-established techniques. 

25 IR agonist amino acid sequences provided by this invention are 

useful as insulin analogs and may therefore be developed as treatments for 
diabetes or other diseases associated with a decreased response or 
production of insulin. For use as an insulin supplement or replacement, 
non-limiting examples of amino acid sequences include D117/H2C: 

30 FHENFYDWFVRQVSK (SEQ ID NO:1780); D117/H2C minus terminal 
lysine: FHENFYDWFVRQVS (SEQ ID NO:1557); D118: 
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DYKDFYDAIQLVRSARAGGTRDKK (SEQ ID NO:1781); D118 minus 
FLAG® tag and terminal lysines: FYDAIQLVRSARAGGTRD (SEQ ID 
NO:1782); D119: KDRAFYNGLRDLVGAVYGAWDKK (SEQ ID NO:1733); 
D119 minus terminal lysines: KDRAFYNGLRDLVGAVYGAWD (residues 1- 
5 21 of SEQ ID NO: 1733); D116/JBA5: DYKDLCQSWGVRIGWLAGLCPKK 
(SEQ ID NO:1541); D116/JBA5 minus FLAG® tag and terminal lysines: 
LCQSWGVRIGWLAGLCP (SEQ ID NO:1542); D113/H2: 
DYKDVTFTSAVFHENFYDWFVRQVSKK (SEQ ID NO:1783); D113/H2 
minus FLAG® tag and terminal lysines: VTFTSAVFHENFYDWFVRQVS 

10 (SEQ ID NO:1784); and S175: GRVDWLQRNANFYDWFVAELG (SEQ ID 
NO: 1560). Preferred peptide dimer sequences are represented by S325, 
S332, S333, S335, S337, S353, S374-S376, S378, S379, S381, S414, 
S415, and S418 (see Table 7). Other preferred dimers sequences are 
represented by S455, S457, S458, S467, S468, S471, S499, S510, S518, 

15 S519, and S520 sequences (see Table 7). Especially preferred is the S519 
dimer sequence, which shows in vitro and in vivo activity comparable to 
insulin (see Figures 31A-C, 32A-B, and 33), S557 (see, e.g., Figure 55), and 
S597 (see, e.g., Figures 54-56). 

IGF-1R antagonist amino acid sequences provided by this invention 

20 are useful as treatments for cancers, including, but not limited to, breast, 
prostate, colorectal, and ovarian cancers. Human and breast cancers are 
responsible for over 40,000 deaths per year, as present treatments such as 
surgery, chemotherapy, radiation therapy, and immunotherapy show limited 
success. The IGF-1R antagonist amino acid sequences disclosed herein 

25 are also useful for the treatment or prevention of diabetic retinopathy. 
Recent reports have shown that a previously identified IGF-1R antagonist 
can suppress retinal neovascularization, which causes diabetic retinopathy 
(Smith et a/., 1999, Nat Med. 5:1390-1395). 

IGF-1R agonist amino acid sequences provided by this invention are 

30 useful for development as treatments for neurological disorders, including 
stroke and diabetic neuropathy. Reports of several different groups 
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implicate IGF-1R in the reduction of global brain ischemia, and support the 
use of IGF-1 for the treatment of diabetic neuropathy (reviewed in Auer ef 
a/., 1998, Neurology 51:S39-S43; Apfel, 1999, Am. J. Med. 107:34S-42S). 

I. Modification of Peptides 

5 The peptides of the invention may be subjected to one or more 

modifications known in the art, which may be useful for manipulating storage 
stability, pharmacokinetics, and/or any aspect of the bioactivity of the 
peptide, such as, e.g., potency, selectivity, and drug interaction. Chemical 
modification to which the peptides may be subjected includes, without 

10 limitation, the conjugation to a peptide of one or more of polyethylene glycol 
(PEG), monomethoxy-polyethylene glycol, dextran, poly-(N-vinyl 
pyrrolidone) polyethylene glycol, propylene glycol homopolymers, a 
polypropylene oxide/ethylene oxide co-polymer, polypropylene glycol, 
polyoxyethylated polyols (e.g., glycerol) and polyvinyl alcohol, colominic 

1 5 acids or other carbohydrate based polymers, polymers of amino acids, and 
biotin derivatives. PEG conjugation of proteins at Cys residues is disclosed, 
e.g., in Goodson, R. J. & Katre, N. V. (1990) Bio/Technology 8, 343 and 
Kogan, T. P. (1992) Synthetic Comm. 22, 2417. 

Other useful modifications include, without limitation, acylation, using 

20 methods and compositions such as described in, e.g., U.S. Patent Serial No. 
6,251 , 856, and WO 00/551 1 9. 



J. Therapeutic Administration 

The peptides of the present invention may be administered 
25 individually or in combination with other pharmacologically active agents. It 
will be understood that such combination therapy encompasses different 
therapeutic regimens, including, without limitation, administration of multiple 
agents together in a single dosage form or in distinct, individual dosage 
forms. If the agents are present in different dosage forms, administration 
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may be simultaneous or near-simultaneous or may follow any 
predetermined regimen that encompasses administration of the different 
agents. 

For example, when used to treat diabetes or other diseases or 
5 syndromes associated with a decreased response or production of insulin, 
hyperlipidemia, obesity, appetite-related syndromes, and the like, the 
peptides of the invention may be advantageously administered in a 
combination treatment regimen with one or more agents, including, without 
limitation, insulin, insulin analogues, insulin derivatives, glucagon-like 

1 0 peptide-1 or-2 (GLP-1 , GLP-2), derivatives or analogues of GLP-1 or GLP-2 
(such as are disclosed, e.g., in WO 00/551 19). It will be understood that an 
"analogue" of insulin, GLP-1 , or GLP-2 as used herein refers to a peptide 
containing one or more amino acid substitutions relative to the native 
sequence of insulin, GLP-1, or GLP-2, as applicable; and "derivative" of 

15 insulin, GLP-1, or GLP-2 as used herein refers to a native or analogue 
insulin, GLP-1, or GLP-2 peptide that has undergone one or more additional 
chemical modifications of the amino acid sequence, in particular relative to 
the natural sequence. Insulin derivatives and analogues are disclosed, e.g., 
in U.S. Patent Serial No. 5,656,722, 5,750,497, 6,251,856, and 6,268,335. 

20 In some embodiments, the combination agent is one of Lys B29 ( - 
myristoyl)des(B30) human insulin, Lys B29 ( -tetradecanoyl)des(B30) human 
insulin and B 29 -N -(N-lithocolyl- -glutamyl)-des(B30) human insulin. Also 
suitable for combination therapy are non-peptide antihyperglycemic agents, 
antihyperlipidemic agents, and the like such as those well-known in the art. 

25 In one embodiment, the invention encompasses methods of treating 

diabetes or related syndromes comprising administering a first amount of 
peptide S597 or peptide S557 and a second amount of a long-acting insulin 
analogue, such as, e.g., Lys B29 ( -myristoyl)des(B30) human insulin, 
Lys B29 ( -tetradecanoyl)des(B30) human insulin, or B 29 -N -(N-lithocolyl- - 

30 glutamyl)-des(B30) human insulin, wherein the first and second amounts 
together are effective for treating the syndrome. As used herein, a long- 
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acting insulin analogue is one that exhibits a protracted profile of action 
relative to native human insulin, as disclosed, e.g., in U.S. Patent Serial No. 
6,451,970. 

The peptides of the present invention may be administered 
5 individually or in combination with other pharmacologically active agents. It 
will be understood that such combination therapy encompasses different 
therapeutic regimens, including, without limitation, administration of multiple 
agents together in a single dosage form or in distinct, individual dosage 
forms. If the agents are present in different dosage forms, administration 
10 may be simultaneous or near-simultaneous or may follow any 
predetermined regimen that encompasses administration of the different 
agents. 

For example, when used to treat diabetes or other diseases or 
syndromes associated with a decreased response or production of insulin, 

15 hypertipidemia, obesity, appetite-related syndromes, and the like, the 
peptides of the invention may be advantageously administered in a 
combination treatment regimen with one or more agents, including, without 
limitation, insulin, insulin analogues, insulin derivatives, glucagon-like 
peptide-1 or-2 (GLP-1 , GLP-2), derivatives or analogues of GLP-1 or GLP-2 

20 (such as are disclosed, e.g., in WO 00/551 19). It will be understood that an 
"analogue" of insulin, GLP-1, or GLP-2 as used herein refers to a peptide 
containing one or more amino acid substitutions relative to the native 
sequence of insulin, GLP-1, or GLP-2, as applicable; and "derivative" of 
insulin, GLP-1, or GLP-2 as used herein refers to a native or analogue 

25 insulin, GLP-1 , or GLP-2 peptide that has undergone one or more additional 
chemical modifications of the amino acid sequence, in particular relative to 
the natural sequence. Insulin derivatives and analogues are disclosed, e.g., 
in U.S. Patent Serial No. 5,656,722, 5,750,497, 6,251,856, and 6,268,335. 
In some embodiments, the combination agent is one of Lys B29 ( - 

30 myristoyl)des(B30) human insulin, Lys B29 ( -tetradecanoyl)des(B30) human 
insulin and B 29 -N -(N-lithocolyl- -glutamyl)-des(B30) human insulin. Also 
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suitable for combination therapy are non-peptide antihyperglycemic agents, 
antihyperlipidemic agents, and the like such as those well-known in the art. 

In one embodiment, the invention encompasses methods of treating 
diabetes or related syndromes comprising administering a first amount of 
5 peptide S597 or peptide S557 and a second amount of a long-acting insulin 
analogue, such as, e.g., Lys B29 ( -myristoyl)des(B30) human insulin, 
Lys B29 ( -tetradecanoyl)des(B30) human insulin, or B 29 -N -(Withocolyl- - 
glutamyl)-des(B30) human insulin, wherein the first and second amounts 
together are effective for treating the syndrome. As used herein, a long- 
10 acting insulin analogue is one that exhibits a protracted profile of action 
relative to native human insulin, as disclosed, e.g., in U.S. Patent Serial No. 
6,451,970. 

K. Methods of Administration 

The amino acid sequences of this invention may be administered as 
pharmaceutical compositions comprising standard carriers known in the art 
for delivering proteins and peptides and by gene therapy. Preferably, a 
pharmaceutical composition includes, in admixture, a pharmaceutical ly (i.e., 
physiologically) acceptable carrier, excipient, or diluent, and one or more of 
an IR or IGF-1R agonist or antagonist peptide, as an active ingredient. The 
preparation of pharmaceutical compositions that contain peptides as active 
ingredients is well understood in the art. Typically, such compositions are 
prepared as injectables, either as liquid solutions or suspensions, however, 
solid forms suitable for solution in, or suspension in, liquid prior to injection 
can also be prepared. The preparation can also be emulsified. The active 
therapeutic ingredient is often mixed with excipients that are 
pharmaceutical^ (i.e., physiologically) acceptable and compatible with the 
active ingredient. Suitable excipients are, for example, water, saline, 
dextrose, glycerol, ethanol, or the like and combinations thereof. In addition, 
if desired, the composition can contain minor amounts of auxiliary 
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substances such as wetting or emulsifying agents, pH-buffering agents, 
which enhance the effectiveness of the active ingredient. 

An IR or IGF-1R agonist or antagonist peptide can be formulated into 
a pharmaceutical composition as neutralized physiologically acceptable salt 
5 forms. Suitable salts include the acid addition salts (i.e., formed with the 
free amino groups of the peptide molecule) and which are formed with 
inorganic acids such as, for example, hydrochloric or phosphoric acids, or 
such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts 
formed from the free carboxyl groups can also be derived from inorganic 

10 bases such as, for example, sodium, potassium, ammonium, calcium, or 
ferric hydroxides, and such organic bases as isopropylamine, 
trimethylamine, 2-ethylamino ethanol, histidine, procaine, and the like. 

The pharmaceutical compositions can be administered systemically 
by oral or parenteral routes. Non-limiting parenteral routes of administration 

15 include subcutaneous, intramuscular, intraperitoneal, intravenous, 
transdermal, inhalation, intranasal, intra-arterial, intrathecal, enteral, 
sublingual, or rectal. Due to the labile nature of the amino acid sequences 
parenteral administration is preferred. Preferred modes of administration 
include aerosols for nasal or bronchial absorption; suspensions for 

20 intravenous, intramuscular, intrasternal or subcutaneous, injection; and 
compounds for oral administration. 

Intravenous administration, for example, can be performed by 
injection of a unit dose. The term "unit dose" when used in reference to a 
pharmaceutical composition of the present invention refers to physically 

25 discrete units suitable as unitary dosage for humans, each unit containing a 
predetermined quantity of active material calculated to produce the desired 
therapeutic effect in association with the required diluent; i.e., liquid used to 
dilute a concentrated or pure substance (either liquid or solid), making that 
substance the correct (diluted) concentration for use. For injectable 

30 administration, the composition is in sterile solution or suspension or may be 
emulsified in pharmaceutical^- and physiologically-acceptable aqueous or 
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oleaginous vehicles, which may contain preservatives, stabilizers, and 
material for rendering the solution or suspension isotonic with body fluids 
(i.e., blood) of the recipient. 

Excipients suitable for use are water, phosphate buffered saline, pH 
5 7.4, 0.15 M aqueous sodium chloride solution, dextrose, glycerol, dilute 
ethanol, and the like, and mixtures thereof. Illustrative stabilizers are 
polyethylene glycol, proteins, saccharides, amino acids, inorganic acids, and 
organic acids, which may be used either on their own or as admixtures. The 
amounts or quantities, as well as routes of administration, used are 

10 determined on an individual basis, and correspond to the amounts used in 
similar types of applications or indications known to those of skill in the art. 

Pharmaceutical compositions are administered in a manner 
compatible with the dosage formulation, and in a therapeutically effective 
amount. The quantity to be administered depends on the subject to be 

15 treated, capacity of the subject's immune system to utilize the active 
ingredient, and degree of modulation of IR or IGF-1R activity desired. 
Precise amounts of active ingredient required to be administered depend on 
the judgment of the practitioner and are specific for each individual. 
However, suitable dosages may range from about 10 to 200 nmol active 

20 peptide per kilogram body weight of individual per day and depend on the 
route of administration. Suitable regimes for initial administration and 
booster shots are also variable, but are typified by an initial administration 
followed by repeated doses at one or more hour intervals by a subsequent 
injection or other administration. Alternatively, continuous intravenous 

25 infusions sufficient to maintain picomolar concentrations (e.g., approximately 
1 pM to approximately 10 nM) in the blood are contemplated. An exemplary 
formulation comprises the IR or IGF-1R agonist or antagonist peptide in a 
mixture with sodium busulfite USP (3.2 mg/ml); disodium edetate USP (0.1 
mg/ml); and water for injection q.s.a.d. (1 ml). 

30 Further guidance in preparing pharmaceutical formulations can be 

found in, e.g., Gilman et ai (eds), 1990, Goodman and Oilman's: The 
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Pharmacological Basis of Therapeutics, 8th ed., Pergamon Press; and 
Remington's Pharmaceutical Sciences, 17th ed. f 1990, Mack Publishing 
Co., Easton, PA; Avis et ai (eds), 1993, Pharmaceutical Dosage Forms: 
Parenteral Medications, Dekker, New York; Lieberman et ai (eds), 1990, 
5 Pharmaceutical Dosage Forms: Disperse Systems, Dekker, New York. 

The present invention further contemplates compositions comprising 
an IR or IGF-1R agonist or antagonist peptide, and a physiologically 
acceptable carrier, excipient, or diluent as described in detail herein. 

The constructs as described herein may also be used in gene 

10 transfer and gene therapy methods to allow the expression of one or more 
amino acid sequences of the present invention. The amino acid sequences 
of the present invention can be used for gene therapy and thereby provide 
an alternative method of treating diabetes which does not rely on the 
administration or expression of insulin. Expressing insulin for use in gene 

15 therapy requires the expression of a precursor product, which must then 
undergo processing including cleavage and disulfide bond formation to form 
the active product. The amino acid sequences of this invention, which 
possess activity, are relatively small, and thus do not require the complex 
processing steps to become active. Accordingly, these sequences provide a 

20 more suitable product for gene therapy. 

Gene transfer systems known in the art may be useful in the practice 
of the gene therapy methods of the present invention. These include viral 
and non-viral transfer methods. A number of viruses have been used as 
gene transfer vectors, including polyoma, i.e., SV40 (Madzak et a/., 1992, J. 

25 Gen. Virol. , 73:1533-1536), adenovirus (Berkner, 1992, Curr. Top. Microbiol. 
Immunol., 158:39-6; Berkner et ai, 1988, Bio Techniques, 6:616-629; 
Gorziglia et ai, 1992, J. Virol., 66:4407-4412; Quantin et ai, 1992, Proc. 
Natl. Acad. Sci. USA, 89:2581-2584; Rosenfeld et ai, 1992, Cell, 68:143- 
155; Wilkinson et ai, 1992, Nuci Acids Res., 20:2233-2239; Stratford- 

30 Perricaudet et ai, 1990, Hum. Gene Ther. t 1:241-256), vaccinia virus 
(Mackett et ai, 1992, Biotechnology, 24:495- 499), adeno-associated virus 
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(Muzyczka, 1992, Curr. Top. Microbiol. Immunol. 158:91- 123; Ohi et al., 
1990, Gene, 89:279-282), herpes viruses including HSV and EBV 
(Margolskee, 1992, Curr. Top. Microbiol. Immunol. 158:67-90; Johnson et 
al., 1992, J. Virol., 66:2952-2965; Fink era/., 1992, Hum. Gene Ther. 3:11- 
5 19; Breakfield et al., 1987, Mol. Neurobioi, 1:337-371; Fresse et al., 1990, 
Biochem. Pharmacol. 40:2189-2199), and retroviruses of avian 
(Brandyopadhyay et al., 1984, Mol. Cell Biol., 4:749-754; Petropouplos et 
al., 1992, J. Virol., 66:3391-3397), murine (Miller, 1992, Curr. Top. Microbiol. 
Immunol. 158:1-24; Miller et al., 1985, Mol. Cell Biol., 5:431-437; Sorge et 

10 al., 1984, Mol. Cell Biol., 4:1730-1737; Mann et al., 1985, J. Virol., 54:401- 
407), and human origin (Page et al., 1990, J. Virol., 64:5370-5276; 
Buchschalcher ef al., 1992, J. Virol., 66:2731-2739). Most human gene 
therapy protocols have been based on disabled murine retroviruses. 

Non-viral gene transfer methods known in the art include chemical 

15 techniques such as calcium phosphate coprecipitation (Graham et al., 
1973, Virology, 52:456-467; Pellicer ef al., 1980, Science, 209:1414- 
1422), mechanical techniques, for example microinjection (Anderson et 
al., 1980, Proc. Natl. Acad. Sci. USA, 77:5399-5403; Gordon etal., 1980, 
Proc. Natl. Acad. Sci. USA, 77:7380-7384; Brinster et al., 1981, Cell, 

20 27:223-231; Constants et al., 1981, Nature, 294:92-94), membrane 
fusion-mediated transfer via liposomes (Feigner ef al., 1987, Proc. Natl. 
Acad. Sci. USA, 84:7413-7417; Wang etal., 1989, Biochemistry, 28:9508- 
9514; Kaneda et al., 1989, J. Biol. Chem., 264:12126-12129; Stewart ef 
al., 1992, Hum. Gene Ther. 3:267-275; Nabel et al., 1990, Science, 

25 249:1285-1288; Lim et al., 1992, Circulation, 83:2007-2011; U.S. Patent 
Nos. 5,283,185 and 5,795,587), and direct DNA uptake and receptor- 
mediated DNA transfer (Wolff et al., 1990, Science, 247:1465-1468; Wu 
et al., 1991, BioTechniques, 11:474-485; Zenke et al., 1990, Proc. Natl. 
Acad. Sci. USA, 87:3655-3659; Wu et al., 1989, J. Biol. Chem., 

30 264:16985-16987; Wolff ef al., 1991, BioTechniques, 11:474-485; Wagner 
et al, 1991, Proc. Natl. Acad. Sci. USA, 88:4255-4259; Cotten et al., 
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1990, Proc. Natl. Acad. Sci. USA, 87:4033-4037; Curiel et a/., 1991, Proc. 
Natl. Acad. Sci. USA, 88:8850-8854; Curiel et a/., 1991, Hum. Gene Ther. 
3:147-154). 

Many types of cells and cell lines (e.g., primary cell lines or 
5 established cell lines) and tissues are capable of being stably transfected 
by or receiving the constructs of the invention. Examples of cells that 
may be used include, but are not limited to, stem cells, B lymphocytes, T 
lymphocytes, macrophages, other white blood lymphocytes (e.g., 
myelocytes, macrophages, or monocytes), immune system cells of 

10 different developmental stages, erythroid lineage cells, pancreatic cells, 
lung cells, muscle cells, liver cells, fat cells, neuronal cells, glial cells, 
other brain cells, transformed cells of various cell lineages corresponding 
to normal cell counterparts (e.g., K562, HEL, HL60, and MEL cells), and 
established or otherwise transformed cells lines derived from all of the 

15 foregoing. In addition, the constructs of the present invention may be 
transferred by various means directly into tissues, where they would 
stably integrate into the cells comprising the tissues. Further, the 
constructs containing the DNA sequences of the peptides of the invention 
can be introduced into primary cells at various stages of development, 

20 including the embryonic and fetal stages, so as to effect gene therapy at 
early stages of development. 

In one approach, plasmid DNA is complexed with a polylysine- 
conjugated antibody specific to the adenovirus hexon protein, and the 
resulting complex is bound to an adenovirus vector. The trimolecular 

25 complex is then used to infect cells. The adenovirus vector permits 
efficient binding, internalization, and degradation of the endosome before 
the coupled DNA is damaged. 

In another approach, liposome/DNA is used to mediate direct in vivo 
gene transfer. While in standard liposome preparations the gene transfer 

30 process is non-specific, localized in vivo uptake and expression have been 



WO 03/070747 



PCT/US02/30312 



-86- 

reported in tumor deposits, for example, following direct in situ 
administration (Nabel, 1992, Hum. Gene Then 3:399-410). 

Suitable gene transfer vectors possess a promoter sequence, 
preferably a promoter that is cell-specific and placed upstream of the 
5 sequence to be expressed. The vectors may also contain, optionally, one or 
more expressible marker genes for expression as an indication of successful 
transfection and expression of the nucleic acid sequences contained in the 
vector. In addition, vectors can be optimized to minimize undesired 
immunogenicity and maximize long-term expression of the desired gene 
10 product(s) (see Nabe, 1999, Proc. Natl. Acad. Sci. USA 96:324-326). 
Moreover, vectors can be chosen based on cell-type that is targeted for 
treatment. 

Illustrative examples of vehicles or vector constructs for transfection 
or infection of the host cells include replication-defective viral vectors, DNA 

15 virus or RNA virus (retrovirus) vectors, such as adenovirus, herpes simplex 
virus and adeno-associated viral vectors. Adeno-associated virus vectors 
are single stranded and allow the efficient delivery of multiple copies of 
nucleic acid to the cell's nucleus. Preferred are adenovirus vectors. The 
vectors will normally be substantially free of any prokaryotic DNA and may 

20 comprise a number of different functional nucleic acid sequences. An 
example of such functional sequences may be a DNA region comprising 
transcriptional and translational initiation and termination regulatory 
sequences, including promoters (e.g., strong promoters, inducible 
promoters, and the like) and enhancers which are active in the host cells. 

25 Also included as part of the functional sequences is an open reading frame 
(polynucleotide sequence) encoding a protein of interest. Flanking 
sequences may also be included for site-directed integration. In some 
situations, the 5'-flanking sequence will allow homologous recombination, 
thus changing the nature of the transcriptional initiation region, so as to 

30 provide for inducible or non-inducible transcription to increase or decrease 
the level of transcription, as an example. 
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In general, the encoded and expressed peptide may be intracellular, 
i.e., retained in the cytoplasm, nucleus, or in an organelle, or may be 
secreted by the cell. For secretion, a signal sequence may be fused to the 
peptide sequence. As previously mentioned, a marker may be present for 
5 selection of cells containing the vector construct. The marker may be an 
inducible or non-inducible gene and will generally allow for positive selection 
under induction, or without induction, respectively. Examples of marker 
genes include neomycin, dihydrofolate reductase, glutamine synthetase, 
and the like. The vector employed will generally also include an origin of 

10 replication and other genes that are necessary for replication in the host 
cells, as routinely employed by those having skill in the art. As an example, 
the replication system comprising the origin of replication and any proteins 
associated with replication encoded by a particular virus may be included as 
part of the construct. The replication system must be selected so that the 

15 genes encoding products necessary for replication do not ultimately 
transform the cells. Such replication systems are represented by 
replication-defective adenovirus (see G. Acsadi et a/., 1994, Hum. Mol. 
Genet. 3:579-584) and by Epstein-Barr virus. Examples of replication 
defective vectors, particularly, retroviral vectors that are replication 

20 defective, are BAG, (see Price et a/., 1987, Proc. Natl. Acad. Sci. USA, 
84:156; Sanes et a/., 1986, EMBO J., 5:3133). It will be understood that the 
final gene construct may contain one or more genes of interest, for example, 
a gene encoding a bioactive metabolic molecule. In addition, cDNA, 
synthetically produced DNA or chromosomal DNA may be employed 

25 utilizing methods and protocols known and practiced by those having skill in 
the art. 

According to one approach for gene therapy, a vector encoding an IR 
or IGF-1R agonist or antagonist peptide is directly injected into the recipient 
cells (in vivo gene therapy). Alternatively, cells from the intended recipients 
30 are explanted, genetically modified to encode an IR or IGF-1R agonist or 
antagonist peptide, and reimplanted into the donor (ex vivo gene therapy). 
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An ex vivo approach provides the advantage of efficient viral gene transfer, 
which is superior to in vivo gene transfer approaches. In accordance with ex 
vivo gene therapy, the host cells are first transfected with engineered 
vectors containing at least one gene encoding an IR or IGF-1 R agonist or 
5 antagonist peptide, suspended in a physiologically acceptable carrier or 
excipient such as saline or phosphate buffered saline, and the like, and then 
administered to the host or host cells. The desired gene product is 
expressed by the injected cells, which thus introduce the gene product into 
the host. The introduced gene products can thereby be utilized to treat or 
10 ameliorate a disorder that is related to altered insulin or IGF-1 levels (e.g., 
diabetes). 

The described constructs may be administered in the form of a 
pharmaceutical preparation or composition containing a pharmaceutical^ 
acceptable carrier and a physiological excipient, in which preparation the 

15 vector may be a viral vector construct, or the like, to target the cells, tissues, 
or organs of the recipient organism of interest, including human and non- 
human mammals. The composition may be formed by dispersing the 
components in a suitable pharmaceutical^ acceptable liquid or solution such 
as sterile physiological saline or other injectable aqueous liquids. The 

20 amounts of the components to be used in such compositions may be 
routinely determined by those having skill in the art. The compositions may 
be administered by parenteral routes of injection, including subcutaneous, 
intravenous, intramuscular, and intrastemal. 

EXAMPLES 

25 The examples as set forth herein are meant to exemplify the various 

aspects of the present invention and are not intended to limit the invention in 
any way. 

The following materials were used in the examples described below. 
Soluble IGF-1 R was obtained from R&D Systems (Minneapolis, MN; Cat. # 
30 391-GR/CF). Insulin receptor was prepared according to Bass et a/., 1996. 
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The insulin was either from Sigma (St. Louis, MO; Cat. # 1-0259) or 
Boehringer. The IGF-1 was from PeproTech (Cat. # 100-1 1). All synthetic 
peptides were synthesized by Novo Nordisk, AnaSpec, Inc. (San Jose, CA), 
PeptioGenics (Livermore, CA), or Research Genetics (Huntsville, AL) at 
5 >80% purity. The Maxisorb Plates were from NUNC via Fisher (Cat. # 
12565347). The HRP/Anti-M13 conjugate was from Pharmacia (Cat. # 27- 
9421-01). The ABTS solution was from BioF/X (Cat. # ABTS-01 00-04). 

Example 1 : Monomer and Dimer Peptides 
A. Cloning 

10 Monomer and dimer peptides were constructed and expressed as 

protein fusions to a chitin binding domain (CBD) using the pTYB2 vector 
from the IMPACT™-CN system (New England Biolabs (NEB), Beverly, MA). 
The pTYB2 vector encodes a protein-splicing element (termed intein), which 
initiates self-cleavage upon the addition of DTT. The intein self-cleavage 

1 5 separates the dimer from the affinity tag, to allow purification. 

In the pTYB2 construct, the C-terminus of the peptide sequence was 
fused to the N-terminus of the intein/CBD sequence. Two peptide-flanking 
epitope tags were included: a shortened-FLAG® at the N-terminus and E- 
Tag at the C-terminus. This fusion was generated by ligating a vector 

20 fragment encoding the intein/CBD with a PGR product encoding the peptide 
of interest. 

The vector fragment was obtained by digesting at appropriate 
restriction sites the pTBY2 vector. The digested DNA fragment was 
resolved on a 1% agarose gel, excised, and purified by QIAEXII (QIAGEN, 
25 Valencia, CA). To obtain the PCR product of the target proteins, primers 
were synthesized which anneal to appropriate sequences. The vector and 
insert were ligated overnight at 15°C. The ligation product was purified 
using QIAquick spin columns (QIAGEN) and electroporations were 
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performed at 1500 V in an electroporation cuvette (0.1 mm gap; 0.5 ml 
volume) containing 10 ng of DNA and 40 pi of E. coli strain BL21 . 

Immediately following electroporation, 1 ml of pre-warmed (40°C) 
2xYT medium containing 2% glucose (2xYT-G) was added to the 
5 transformants. The transformants were grown at 37°C for 1 h, and then 
plated onto 2xYT-AG plates and incubated overnight at 37°C. Individual 
colonies were isolated and used to innoculate 2xYT-G. The cultures were 
grown overnight at 37°C. Plasmid DNA was isolated from the cultures and 
sequencing was performed to confirm that the correct construct was 
10 obtained. 

B. Small-scale expression of peptide-CBD fusion proteins 

E .coli ER2566 (New England Biolabs) containing plasmids encoding 
peptide-CBD fusion proteins were grown in 2xYT-AG at 37°C overnight, with 
agitation (250 rpm). The following day, the cultures were used to inoculate 

15 media (2x YT-G) to obtain an OD600 of 0.1 . Upon reaching an OD 6 oo of 0.6, 
expression of the fusion protein was induced by the addition of IPTG 
(isopropyl-p-D-thiogalactopyranoside) to a final concentration of 0.3 mM. 
Cells were grown for 3 h. Following this, cells were pelleted by 
centrifugation and the cell pellets were analyzed by SDS-PAGE 

20 electrophoresis. Production of the correct molecular weight fusion proteins 
was confirmed by Western blot analysis using the monoclonal antibody anti- 
E-Tag-HRP conjugate (Amersham Pharmacia). 

C. Large-scale expression and purification of soluble 
peptide-CBD fusion proteins 

25 E. coli ER2566 carrying plasmids encoding the fusion proteins were 

grown in 2xYT-AG media at 37°C for 8 h, with agitation (250 rpm). The 
cultures were back-diluted into to 2 L volumes of 2xYT-A to achieve an 
OD 6 oo of 0.1. Upon reaching an OD 6 oo of 0.5, IPTG was added to a final 
concentration of 0.3 mM. Cells were grown at 30°C overnight. The next day 

30 cells were isolated by centrifugation. Samples of the cell pellet were 
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analyzed by SDS-PAGE followed by the Western blot analysis using the 
mouse monoclonal antibody anti-E-Tag-HRP conjugate (Pharmacia) to 
visualize the expressed product. 

D. Purification 

5 The cell pellets were disrupted mechanically by sonication or 

chemically by treatment with the mild detergent. After removal of cell debris 
by centrifugation, the soluble proteins in the clarified lysate were prepared 
for chromatographic purification by dilution or dialysis into the appropriate 
starting buffer. The CBD fusions were purified by chitin affinity 

10 chromatography according to the manufacturer's instructions (New England 
Biolabs). The lysate was loaded onto a chitin affinity column and the column 
was washed with 10 volumes of column buffer. Three bed volumes of the 
DTT containing cleavage buffer were loaded onto the column and the 
* column was incubated overnight. The next day, the target protein was 

15 eluted by continuing the flow of the cleavage buffer without DTT. The 
purified proteins were analyzed for purity and integrity by SDS-PAGE and 
Western blot analysis according to standard protocols. 

Example 2: PEG-Based Dimer Peptides 

A. Synthesis of the aldehyde containing peptide 

20 The peptide was synthesized by stepwise solid phase synthesis on 

Rink amide Tentagel (0.21 mmol/g). Three equivalents of Fmoc-amino 
acids were used. The serine residue was introduced into the peptide by 
either coupling Frnoc-Ser(tBu)-OH to the N-terminal peptide or coupling 
Boc-Ser(tBu) to a selectively protected lysine side-chain. The peptide was 

25 then deprotected and cleaved from the resin by treatment with 95% TFA 
(trifluoroacetic acid; aq) containing TIS (triisopropylsilan). Periodate 
oxidation, using 2 equivalent of Nal0 4 in 20% DMSO (dimethyl sulfoxide)- 
80% phosphate buffer pH 7.5 (45 |al/|imol peptide) for 5 min at room 
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temperature (RT), converted the 2-amino alcohol moiety in an a-oxoacyl 
group. The peptide was purified immediately following oxidation. 

B. Synthesis of the PEG-based dimer 

The unprotected and oxidized peptide (4.2 equivalent) was dimerized 
5 on the dioxyamino-PEG (polyethylene glycol)-linker (1 equivalent) in 90% 
DMSO-10% 20 mM NaOAc buffer, pH 5.1 (4.2 ^mol peptide). The 
solution was left for 1 hr at 38°C and the progress of the reaction was 
monitored by MALDI-MS (matrix-assisted laser desorption/ionization mass 
spectrometry). Following this, the crude dimer was purified by semi- 
10 preparative HPLC (high performance liquid chromatography). 

The molecular weights and inter peptide distance of various linkers is 
shown in Table 3, below. 



TABLE 3 



Structure 


Number 


MW 


MW (-2H 2 0) 






1 


100.1 


64.1 




2 


58.04 


22.04 




3 


149.15 


113.15 






4 


150.14 


114.14 












o — ^ ^ — o 


5 


134.13 


98.13 
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6 


134.13 


98.13 












7 


134.13 


98.13 




8 


234.25 


198.25 


0 


Q 


ouz.o 


ZOO. o 












10 


72.06 


36.06 




11 


86.09 


50.09 




12 


114.14 


78.14 




13 


•4 1Q AD 

IZo.Oo 


92.08 




14 


142.19 


106.19 


(HCO) 4 -(Lys) 2 -Lys- 
Gly-NH 2 


15 








16 


136.2 


100.2 


NH,0 — VX^o^V^" ' 


17 


180.2 


144.2 


NH,0^M' 0 '-^t0-^' 0NH ' 


18 


224.3 


188.3 




A ft 

19 


268.3 


232.3 


n o 4 


20 


312.4 


276.4 




21 


278.4 


242.4 




22 


240.3 


204.3 




23 


240.3 


204.3 




24 


210.2 


192.2 
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Example 3: Determination of Insulin Receptor Binding 

IR was incubated with 125 l-labeled insulin at various concentrations of 
test substance and the K<j was calculated. According to this method, human 
insulin receptor (HIR) or human IGF-1 receptor (HIGF-1R) was purified from 
5 transfected cells after solubilization with Triton X-100. The assay buffer 
contained 100 mM HEPES (pH 7.8), 100 mM NaCI, 10 mM MgCI 2 , 0.5% 
human serum albumin (HSA), 0.2% gammaglobulin and 0.025% Triton X- 
100. The receptor concentration was chosen to give 30-60% binding of 
2000 cpm (3 pM) of its 125 l-labeled ligand (TyrA14- 125 l-HI or Tyr31- 125 l-IGF1) 

10 and a dilution series of the substance to be tested was added. After 
equilibration for 2 days at 4°C, each sample (200 \s\) was precipitated by 
addition of 400 (jl 25% PEG 6000, centrifuged, washed with 1 ml 15% PEG 
6000, and counted in a gamma-counter. 

The insulin/IGF-1 competition curve was fitted to a one-site binding 

15 model and the calculated parameters for receptor concentration, insulin 
affinity, and non-specific binding were used in calculating the binding 
constants of the test substances. Representative curves for insulin 
competition are shown in Figures 10A-10C; 11A-11D. Qualitative data are 
provided in Table 4, below. 

20 Table 4 illustrates IR affinities for the RP9 monomer peptide and 

various RP9 monomer truncations. The results demonstrate that RP9 N- 
terminal sequence (GSLD; SEQ ID NO:1785) and C-terminal sequence 
(LGKK; SEQ ID NO:1786) can be deleted without substantially affecting HIR 
binding affinity (Table 4). 

25 TABLE 4 



Peptide 


SEQ ID 

NO: 


Formula 


Site 
IR 


Sequence 


HIRKU(moM) 


S386 


1559 






GSLDESFYDWFERQLG 


3.2*10-7 


S395 


1787 






GSLDESF YD WF ERQL 


9.1*10* 


S394 


1788 






GSLDESFYDWFERQ 


8.1*10-8 


S396 


1789 






GSLDESFYDWFER 


>2*10-s 


S399 


1790 






ESFYDWFERQL 


9. no- 8 


S400 


1791 






ESFYDWFERQ 


6.3*1 0- 7 
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Figures 10A-10C demonstrate that Site 1-Site 2 heterodimer peptides 
537, 538, and 539 bound to IR with substantially higher (several orders of 
magnitude) affinity than corresponding monomer (D117 and 540) and 
5 homodimer (521 and 535) peptides. Figures 1 1 A-1 1 D demonstrate that Site 
1-Site 2 heterodimer peptides, 537 and 538, bound to IR with markedly 
higher affinity than the monomer peptide D1 17. 

Example 4: Adipocyte Assay for Determination of Insulin Agonist 
Activity 

10 Insulin increases uptake of 3 H glucose into adipocytes and its 

conversion into lipid. Incorporation of 3 H into the lipid phase was determined 
by partitioning of lipid phase into a scintillant mixture, which excludes water- 
soluble 3 H products. The effect of compounds on the incorporation of 3 H 
glucose at a sub-maximal insulin dose was determined, and the results 

15 expressed as increase relative to full insulin response. The method was 
adapted from Moody et a/., 1974, Horm Metab Res. 6(1):12-6. 

Mouse epididymal fat pads were dissected out, minced into digestion 
buffer (Krebs-Ringer 25 mM HEPES, 4% HSA, 1.1 mM glucose, 0.4 mg/ml 
Collagenase Type 1, pH 7.4), and digested for up to 1.5 h at 36.5°C. After 

20 filtration, washing (Krebs-Ringer HEPES, 1% HSA), and resuspension in 
assay buffer (Krebs-Ringer HEPES, 1% HSA), free fat cells were pipetted 
into 96-well Picoplates (Packard), containing test solution and approximately 
an ED 2 o insulin. 

The assay was started by addition of 3 H glucose (Amersham TRK 
25 239), in a final concentration of 0.45 mM glucose. The assay was incubated 
for 2 h, 36.5°C, in a Labshaker incubation tower, 400 rpm, then terminated 
by the addition of Permablend/Toluene scintillant (or equivalent), and the 
plates sealed, before standing for at least 1 h and detection in a Packard 
Top Counter or equivalent. A full insulin standard curve (8 dose) was run as 
30 control on each plate. 
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Data are presented graphically, as effect of compound on an 
(approximate) ED 2 o insulin response, with data normalized to a full insulin 
response. The assay can also be run at basal or maximal insulin 
concentration. Representative dose-response curves for insulin and IGF-1 
5 are shown in Figures 12-18. Qualitative data are shown in Tables 5-7. 

In free fat cell (FFC) assays, truncated synthetic RP9 monomer 
peptides S390 and S394 showed potency similar to full-length RP9 
monomer peptides (Figures 12A-12D). Truncated synthetic RP9 homodimer 
peptides S415 and S417 were highly potent in FFC assays, but less potent 

10 than full-length RP9 homodimer peptides (Figures 13A-13C; compare to 
peptides 521 and 535, described below). The potency of recombinant RP9 
homodimer peptides 521 and 535 in FFC assays is shown in Figures 14A- 
14C. The curves are flattened, suggesting that the binding mechanism may 
not be mediated by simple intramolecular binding (Figures 14A-14C). 

15 Results further indicated that synthetic RP9 homodimer peptides 

S337 and S374 showed increased HIR biding affinity and increased potency 
in FFC assays compared to synthetic RP9 monomer, S371 (Table 5). 
Similarly, synthetic RP9 homodimer peptides S314 and S317 showed 
increased HIR binding affinity and increased potency in FFC assays 

20 compared to synthetic RP9 monomer, S371, and various RP9 truncations 
(Table 6). 



TABLE 5 



Pep. 


SEQ 

ID 

NO: 


Formula 


Site 
iR 


Monomer or 
Dimer 


Sequence 


HIR Kd 
(moM) 


FFC 


S371 


1558 


1 


1 


M(RP9) 


GSLDESFYDWFERQLGKK 


6.3.*1(F 


+ 


S337 


1792 


1-1 


1-1 


D, C-Term 23 


(GSLDESFYDWFERQLGKK-Lig)2-23 


1.1*10* 


+++++ 


S374 


1793 


1-1 


1-1 


D, N-Term 17 


1 7-(GSLDESFYDWFERQLGKK) 2 


1.8*10- 7 


++++ 



M = monomer; D = dimer; C-Term = C-terminal linker (C-C); N-Term = N-terminal linker (N- 
25 N); 23 and 17 represent specific chemical linkers (see Table 3); For FFC: 0 is no effect, + is 
agonist, - is antagonist. 
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TABLE 6 



repuae 


ID 

NO: 


Form. 


one 
IR 


wion. or uimer 


Sequence 


UID V 
\ \ \ IwU 1 J 




S371 
(RP9) 


1558 


1 


1 


M 


GSLDESFYDWFERQLGKK 


6.3.*1(F 


+ 


S395 


1787 


1 




M 


6SL0ESFYDWFERQL 


9.1*10-8 


+ 


S394 


1788 


1 




M 


GSLDESFYDWFERQ 


s.rio* 


++ 


S396 


1789 


1 




M 


GSLDESFYDWFER 


>2*10-s 


0 


S390 


1794 


1 




M 


ESFYDWFERQLG 


6.2*10-' 


+ 


S399 


1790 


1 




M 


ESFYDWFERQL 


9.T10* 


++ 


S400 


1791 


1 




M 


ESFYDWFERQ 


6.3*1(F 


0 


S415 


1795 


1-1 


1-1 


D; C-Term 


(ESFYDWFERQLGK)2-23 


1.0*10-7 


++++ 


S417 


1796 


1-1 


1-1 


D; N-Term 


23-{ESFYDWFERQLG)2 


9.2M0- 7 


+++ 



M = monomer; D = dimer; C-Term = C-terminal linker (C-C); N-Term = N-terminal linker (N- 
N); 23 represents a specific chemical linker (see Table 3); For FFC: 0 is no effect, + is 
5 agonist, - is antagonist; Form. = formula; Mon. = monomer; 

Site 1-Site 2 dimer peptides 537 and 538 were inactive in the FFC 
assays using the standard concentration of insulin (Figures 15A-15C). 
However, Site 1 -Site 2 dimer peptides 537 and 538 were antagonists in the 

10 FFC assay in the presence of a stimulating concentration of insulin (Figures 
16A-16C). In contrast, Site 2-Site 1 dimer peptide 539 was a full agonist in 
the FFC assay, with a slope similar to that of insulin (Figures 17A-17B). 

Additional experiments confirmed that FFC assay activity of Site 1- 
Site 2 dimer peptides was affected by the orientation of the monomer 

15 subunits (Figures 18A-18D). In particular, dimer peptides comprising Site 1 
(S372 or S373) and Site 2 (S451 or S452) monomer subunits exhibited 
antagonist activity in the Site 1-Site 2 orientation (C-N linkage) (dimer 
peptide S453); moderate levels of agonist activity in the Site 1-Site 2 
orientation (N-N or C-C linkage) (dimer peptides S454 and S456); and high 

20 levels of agonist activity in the Site 2-Site 1 orientation (C-N linkage) (dimer 
peptide S455) (Figures 18A-18D). 

Table 7, below, shows the HIR binding affinity and FFC assay 
potency of various synthetic peptides, including Site 1-Site 1 dimer peptides 
S325, S329, S332; S333, S334, S335, S336, S337, S349, S350, S351, 
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S352, S353, S354, S361, S362, S363, S374, S375, S376, S378, S379, 
S380, S381, S414, S415, S416,S417, S418, S420, and S424. These 
synthetic dimer peptides exhibited properties comparable to dimer peptides 
521 and 535, regardless of the orientation of the monomer subunits. In 
5 particular, synthetic Site 1-Site 2 dimer peptides S425, S453, and S459 
exhibited antagonist properties comparable to those of the Site 1-Site 2 
dimer peptides 537 and 538. Synthetic Site 1-Site 2 dimer peptides S455, 
S457, and S458 exhibited agonist properties comparable to the dimer 
peptide 539. Synthetic Site 1-Site 2 dimer peptides S436, S437, S438, 

10 S454, S456 act as partial agonists in the FFC assay (i.e., the peptides 
exhibit a maximal response of less than 100% that of insulin), which is 
shown in the table as "++" and "+++". 

Table 7 also shows properties of truncated monomer and dimer 
peptides, and thereby indicates which N- or C-terminal residues can be 

15 deleted without substantial loss of HIR binding affinity (e.g., see synthetic 
peptides S386 through S392, S394 through S403, and S436 through S445). 
Notably, certain Site 2-Site 1 dimers show IR affinities of 2*1 0" 11 (see, e.g., 
S519 and S520). These peptides are also very potent in the fat cell assay 
(Figures 31A-31B) and even more potent in the HIR kinase assay (Figures 

20 32A-32B) (kinase assay described below). 
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FHENFYDWFVAQVSKK 


FHENFYDWFARQVSKK 


FHEAFYDWFVRQVSKK 


FHANFYDWFVRQVSKK 


FAENFYDWFVRQVSKK 


AHENFYDWFVRQVSKK 


fhenfydwfvrqvskk 
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FHENFYGWFVRQVSKK 
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SDGFYNAIELLS 
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Results further indicated that S175-S175 dimer peptides (Site 1-Site 
1) were less agonistic than S175 monomer peptides (++ vs. +++). S175- 
S175 dimer peptides having a C-N linkage were less agonistic or equally 
agonistic as compared to S175-S175 dimer peptides having C-C or N-N 
5 linkages. F8-F8 dimer peptides, like the parent monomer, showed no 
agonist activity. 

Table 7 further indicates that, relative to peptide S519, a potent 
insulin mimetic, the alterations that are most influential in increasing receptor 
affinity and potency are: acetylation of the N-terminal amino group; 
10 replacing V at position 9 with I; replacing E at position 10 with Q; replacing 
Y at position 14 with W; and deleting the sequence GSLD at positions 21 to 



Example 5: Substrate Phosphorylation Assay (HIR Kinase) 

WGA (wheat germ agglutinin)-purified recombinant human insulin 
15 receptor was mixed with either insulin or peptide in varying concentrations in 
substrate phosphorylation buffer (50 mM HEPES (pH 8.0), 3 mM MnCI 2 , 10 
mM MgCI 2 , 0.05 % Triton X-100, 0.1 % BSA, 12.5 ATP). A synthetic 
biotinylated substrate peptide (Biotin-KSRGDYMTMQIG) was added to a 
final concentration of 2 ng/ml. Following a 1 hr incubation at RT, the 
20 reactions were stopped by the addition of 50 mM EDTA. The reactions were 
transferred to Streptavidin coated 96-well microtiter plates (NUNC, Cat. No. 
236001 ) and incubated for 1 hr at RT. The plates were washed 3 times with 



TBS (10 mM Tris (pH 8.0), 150 mM NaCI). 

Subsequently, a 2000-fold dilution of horseradish peroxidase (HRPO) 

25 conjugated phosphotyrosine antibody (Transduction Laboratories, Cat. No. 
E120H) in TBS was added. The plates were incubated for 30 min and 
washed 3 times with TBS. TMB (S.S'AS'-tetramethylbenzidine; Kem-En- 
Tec, Copenhagen, Denmark) was added. One substrate from Kem-En-Tec 
was added. After 10-15 min, the reaction was stopped by the addition of 1% 

30 acetic acid. The absorbance, representing the extent of substrate 



24. 
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phosphorylation, was measured in a spectrophotometer at a wavelength of 
450 nM. 

The results indicated that the potency of the Site 1-Site 2 dimer, 
peptide 539, was 0.1 to 1% of that of insulin in all assays tested (Table 8), 
5 and the dose-response curves (Figures 17A-17B) had a shape similar to 
that of insulin dose-response curves, suggesting an insulin-like action 
mechanism. In addition, Site 1 -Site 2 dimer peptides 537 and 538 were also 
active as specific insulin receptor antagonists (Table 8; Figures 16A-16C). 
Notably, Site 2-Site 1 dimer peptide 539 was more active in the kinase 
10 assay than Site 1-Site 1 homodimer peptides 521 and 535 (Figures 19A- 
19B), despite lower FFC potency (Figures 14A-14C; Figures 17A-17B). 
Similar results are shown in Figures 20A-B and Figures 21A-B. This data 
suggested that homodimer and heterodimer peptides used different 
mechanisms of action. 



15 TABLE 8 



Pep. 


MonV 
Unk. 


Sequence 


SEQ 

ID 

NO: 


Form 


Site 
IR 


HIR 

Kd 

(nM) 


HIGF- 
1R Kd 
(nM) 


FFC 

Pot. 
(nM) 


Kinase 

Pot. 

(nM) 


HI 








na 


na 










HIGF 
-1R 








na 


na 










521 


RP9- 
6aa- 
RP9 


MADYKDDDDKGSLDESFYDWFER 
QLGKKGGSGGSGSLDESFYDWFE 
RQLGKKAAA(ETAG)PG 


2112 


1-1 


1-1 


25 




A 3 


1400 


535 


RP9- 
12aa 
-RP9 


MADYKDDDDKGSLDESFYDWFER 
QLGKKGGSGGSGGSGGSGSLDES 
FYDWFERQLGKKAAA(ETAG)PG 


2113 


1-1 


1-1 


15 




A 2 


1000 


537 


RP9- 
6aa- 
D8 


MADYKDDDDKGSLDESFYDWFER 
QLGKKGGSGGSWLDQEWAWVQC 
EVYGRGCPSAAA(ETAG)PG 


2114 


1-6 


1-2 


0.092 


980 


N 10 


Inactiv 
e 


538 


RP9- 
12aa 
-D8 


MADYKDDDDKGSLDESFYDWFER 
QLGKKGGSGGSGGSGGSWLDQE 
WAWVQCEVYGRGCPSAAA(ETAG) 
PG 


2115 


1-6 


1-2 


0.080 


710 


N 10 


Inactiv 
e 


539 


D8- 

6aa- 

RP9 


MADYKDDDDKWLDQEWAWVQCE 
VYGRGCPSGGSGGSGSLDESFYD 
WFERQLGKKAAA(ETAG)PG 


2116 


6-1 


2-1 


0.530 


1500 


A 10 


110 
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A = agonist; N = antagonist; na = not applicable; Form. = formula; Mon. = constituent 
monomers; Link. = linker; Pot. = potency; HI and HIGF-1R are controls; All with tags at both 
ends; All dimers are linked C-N; Linker sequences are underlined. 

Example 6: IR Autophosphorylation Assays 

5 IR activation was assayed by detecting autophosphorylation of an 

insulin receptor construct transfected into 32D cells (Wang et a/., 1993, 
Science 261:1591-1594; clone 969). The IR transfected 32D cells were 
seeded at 5 x 10 6 cells/well in 96-well tissue culture plates and incubated 
overnight at 37°C. Samples were diluted 1:10 in the stimulation medium 
10 (PRIM1640 with 25 nM HEPES pH 7.2) plus or minus insulin. The culture 
media was decanted from the cell culture plates, and the diluted samples 
were added to the cells. The plates were incubated at 37°C for 30 min. The 
stimulation medium was decanted from the plates, and cell lysis buffer (50 
mM HEPES pH 7.2, 150 mM NaCI, 0.5% Triton X-100, 1 mM AEBSF, 10 
15 KlU/ml aprotinin, 50 pM leupeptin, and 2 mM sodium orthovanadate) was 
added. The cells were lysed for 30 min. 

In the ELISA portion of the assay, the cell lysates were added to the 
BSA-blocked anti-IR unit mAb (Upstate Biotechnology, Lake Placid, NY) 
coated ELISA plates. After a 2 hr incubation, the plates were washed 6 
20 times with PBST and biotinylated anti-phosphotyrosine antibody (Upstate 
Biotechnology) is added. After another 2 h incubation, the plates were 
again washed 6 times. Streptavidin-Eu was then added, and the plates 
were incubated for 1 h. After washing the plates again, EG&G Wallac 
enhancement solution (100 mM acetone-potassium hydrogen pthalate, pH 
25 3.2; 15 mM 2-naphtyltrifluoroacetate; 50 mM tri(n-octyl)-phosphine oxide; 
0.1% Triton X-100) was added into each well, and the plates were placed 
onto a shaker for 20 min at RT. Fluorescence of samples in each well was 
measured at 615 nm using a VICTOR 1420 Multilabel Counter (EG&G 
Wallac). 

30 Alternatively, IR autophosphorylation was determined using a 

holoenzyme phosphorylation assay. In accordance with this assay, 1 pi of 
purified insulin receptor (isolated from a Wheat Germ Agglutinin Expression 
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System) was incubated with 25 nM insulin, or 10 or 50 \M peptide in 50 pi 
autophosphorylation buffer (50 mM HEPES pH. 8,0, 150 mM NaCI, 0.025% 
Triton-X-100, 5 mM MnCl2, 50 pM sodium orthovanadate) containing 10 pM 
ATP for 45 min at 22°C. The reaction was stopped by adding 50 pi of gel 
5 loading buffer containing p-mercaptoethanol (Bio-Rad Laboratories, Inc., 
Hercules, CA). The samples were run on 4-12% SDS-polyacrylamide gels. 
Western Blot analysis was performed by transferring the proteins onto 
nitrocellulose membrane. The membrane was blocked in PBS containing 
3% milk overnight. The membrane was incubated with anti-phosphotyrosine 
10 4G10 HRP labeled antibody (Upstate Biotechnology) for 2 h. Protein bands 
were visualized using SuperSignal West Dura Extended Duration Substrate 
Chemiluminescence Detection System (Pierce Chemical Co.). 

Example 7: Fluorescence-Based HIR Binding Assays 

A. Time-Resolved Fluorescence Resonance Energy Transfer 
15 Assays 

Time-resolved fluorescence resonance energy transfer assays (TR- 
FRET) were used for peptide competition studies. In one set of assays, 
monomer and dimer peptides were tested for the ability to compete with 
biotinylated RP-9 monomer peptide (b-RP9) for binding to HIR- 

20 immunoglobulin heavy chain chimera (sIR-Fc; Bass et a/ M 1996). The 
assays were performed using a 384-well white microplate (NUNC) with a 
final volume of 30 |xl. Final incubation conditions were in 22 nM b-RP9, 1 
nM SA-APC (streptavidin-allophycocyanin), 1 nM Eu 3+ -slR-Fc (LANCE™ 
labeled, PE Wallac, Inc.), 0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 

25 0.0027 M KCI, and 0.1 % BSA (Cohn Fraction V). After 16-24 hr of 
incubation at RT, the fluorescence signal at 665 nm and 620 nm was read 
on a Victor 2 1420 plate reader (PE Wallac, Inc.). Primary data were 
background corrected, normalized to buffer controls, and then expressed as 
percent of specific binding. 
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Results are shown in Figures 22A-22B. Figure 21A shows b-RP9 
competition data. For these figures, the Z'-factor was greater than 0.5 (Z* = 
1-(3a + +3a.)/|H+-^-|; Zhang et a/., 1999, J. Biomol. Screen. 4:67-73), and the 
signal-to-background (S/B) ratio was -4-5. In Figure 22A, each data point 
5 represents the average of two replicate wells. The lines represent the best 
fit to a four-parameter non-linear regression analysis of the data according 
to the following formula: y = min + (max-min)/(1+10 A ((loglC 5 o-x)*Hillslope)). 
This was used to determine IC 5 o values. 

In another set of assays, monomer and dimer peptides were tested 

10 for the ability to compete with biotinylated-S175 (b-S175) or b-RP9 for 
binding to sIR-Fc. The TR-FRET assays were performed in a 384-well white 
microplate with a final volume of 30 \x\. Final incubation conditions were in 
33 nM b-S175 or 22 nM b-RP9, 1 nM SA-APC. 1 nM Eu 3+ -slR-Fc, 0.05 M 
Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M KCI, and 0.1 % BSA. 

15 After 16-24 hr of incubation at RT t the fluorescence signal at 665 nm and 
620 nm was read on a Victor 2 1420 plate reader. Primary data were 
background corrected, normalized to buffer controls, and then expressed as 
% Specific Binding. 

Results are shown in Figures 23A-23B. For these figures, each data 

20 point represents the average of two replicate wells. The lines represent the 
best fit to a four-parameter non-linear regression analysis of the data, which 
was used to determine IC50 values. Figure 23A shows b-S175 competition 
data; Figure 23B shows b-RP9 competition data. 

B. Fluorescence Polarization Assays 

25 Fluorescence polarization assays (FP) were used for peptide 

competition studies. In one set of assays monomer and dimer peptides 
were tested for the ability to compete with fluorescein-RP-9 (FITC-RP9) for 
binding to soluble HIR ectodomain (sIR;' Kristensen et a/., 1998, J. Biol, 
Chem. 273:17780-17786). The assays were performed in a 384-well black 

30 microplate (NUNC) with a final volume of 30 ^l. Final incubation conditions 
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were 1 nM FITC-RP9, 10 nM sIR, 0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M 
NaCI, 0.0027 M KCI. 0.05 % BGG (bovine gamma globulin), 0.005 % 
Tween-20®. After 16-24 hr of incubation at RT, the fluorescence signal at 
520 nm was read on an Analyst™ AD plate reader (LJL BioSystems. Inc.). 
5 Primary data were background corrected using 10 nM sIR without FITC-RP9 
addition, normalized to buffer controls, and then expressed as percent of 
specific binding. The Z'-factor was greater than 0.5 and the assay dynamic 
range was -125 mP. In Figures 24-27, each data point represents the 
average of two replicate wells. The lines represent the best fit to a four- 

10 parameter non-linear regression analysis of the data, which was used to 
determine IC 5 o values. The Z-factor was greater than 0.5 and the assay 
dynamic range was -125 mP. Results are shown in Figures 24A-24B. 

In another set of assays, monomer and dimer peptides were tested 
for the ability to compete with FITC-RP9 for binding to soluble human insulin 

15 mini-receptor (mIR; Kristensen et a/., 1999, J. Biol. Chem. 274:37351- 
37356). The FP assays were performed in a 384-well black microplate with 
a final volume of 30 Final incubation conditions were 2 nM FITC-RP9, 20 
nM mIR, 0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M KCI, 
0.001 % BGG, 0.005 % Tween-20®. After 16-24 hr of incubation at RT, the 

20 fluorescence signal at 520 nm was read on an Analyst™ AD plate reader. 
Primary data were background corrected using 20 nM mIR without FITC- 
RP9 addition, normalized to buffer controls and then expressed as percent 
of specific binding. Results are shown in Figures 25A-25B. 

Monomers and dimer peptides were also tested for the ability to 

25 compete with fluorescein-insulin (FITC-lnsulin) for binding to sIR. The FP 
assays were performed in a 384-well black microplate with a final volume of 
30 i-tl. Final incubation conditions were in 2 nM FITC-lnsulin, 20 nM sIR, 
0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M KCI, 0.05 % BGG, 
0.005 % Tween-20®. After 16-24 hr of incubation at RT, the fluorescence 

30 signal at 520 nm was read on an Analyst™ AD plate reader. Primary data 
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were background corrected using 20 nM sIR without FITC-lnsulin addition, 
normalized to buffer controls and then expressed as percent of specific 
binding. Results are shown in Figures 26A-26B. 

In other assays, peptide monomers and dimer peptides were tested 
5 for the ability to compete with FITC-lnsulin for binding to mIR. The FP 
assays were performed in a 384-well black microplate with a final volume of 
30 til. Final incubation conditions were 2 nM FITC-lnsulin, 20 nM mIR, 0.05 
M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M KCI, 0.05 % BGG 
(bovine gamma globulin), 0.005 % Tween-20®. After 16-24 hr of incubation 
10 at RT, the fluorescence signal at 520 nm was read on an Analyst™ AD plate 
reader. Primary data were background corrected using 20 nM mIR without 
FITC-RP9 addition, normalized to buffer controls and then expressed as % 
Specific Binding. Results are shown in Figures 27A-27B. 

C. Summary 

15 Table 9, below, summarizes the binding data calculated from 

competition assays using the IR constructs, sIR-Fc, sIR, and mIR, in TR- 
FRET and FP formats. The data in Table 9 indicate that most dimer 
peptides (e.g., S291 and S375 or S337), showed greater agonist activity 
than the corresponding monomer peptides (e.g., H2C or RP9, respectively) 

20 in the FFC assay. It was previously demonstrated that an inequality 
between monomer peptides and insulin was exhibited in competition assays 
where the assay reporter was a monomer peptide (i.e., RP9 or S175). This 
inequality was also demonstrated by dimer peptides as seen in Table 9. 
Table 9 further shows that Group 6 monomer peptides such as E8 (D120) 

25 were able to compete with FITC-RP9 or b-RP9 peptides for binding to sIR- 
Fc, but did not compete peptide ligands, such as FITC-RP9 for binding to 
mIR. Experiments using different IR constructs thereby allowed 
differentiation of Site I peptides based on sequence motifs (i.e., Group 6 
(Formula 10) vs. Group 1 (Formula 1; A6)). 
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Based on the functional studies outlined above, the following peptide 
dimers were designed. 



SEQ 
ID 

NO: 


MonomJ 

If 1 VI IWII *M 

Linkers 


Sequence 


2119 


F8-6aa- 
RP9 


HLCVLEELFWGASLFGYCSGGGSGGSGSLDESFYDWFERQL 


2120 


F8-12aa- 
RP9 


HLCVLEELFWGASLFGYCSGGGSGGSGGSGGSGSLDESFYDWFERQL 


2121 


D8-6aa- 
S175 


WLDQEWAWVC3CEVYGRGCPSGGSGGSGRVDWLQRNANFYDWFVAELG 


2122 


D8-12aa- 
S175 


WLDQEWAWVQCEVYGRGCPSGGSGGSGGSGGSGRVDWLQRNANFYDWFVAELG 


2123 


F8-6aa- 
S175 


HLCVLEELFWGASLFGYCSGGGSGGSGRVDWLQRNANFYDWFVAELG 


2124 


F8-12aa- 
S175 


HLCVLEELFWGASLFGYCSGGGSGGSGGSGGSGRVDWLQRNANFYDWFVAELG 


2125 


D8-6aa- 
RP15 


HLCVLEELFWGASLFGYCSGGGSGGSSQAGSAFYAWFDQVLRTV 


2126 


D8-6aa- 
RP6 


HLCVLEELFWGASLFGYCSGGGSGGSTFYSCLASLLTGTPQPNRGPWERCR 


2127 


D8-6aa- 
RP17 


HLCVLEELFWGASLFGYCSGGGSGGSQSDAFYSGLWAUGLSDG 


2128 


D8-6aa- 
Grp6 


HLCVLEELFWGASLFGYCSGGGSGGSDSDWAGYEWFEEQLD 



5 Linker sequences are underlined and in bold; Monomer sequences are shown below; All 
dimers are linked C-N. 



SEQ ID NO: 


Monomer 


Formula 


Site 


Sequence 


1576 


F8 


4 


2 


HLCVLEELFWGASLFGYCSG 


1558 


RP9 


1 


1 


GSLDESFYDWFERQL 


2129 


D8 


6 


2 


WLDQEWAWVQCEVYGRGC PS 


1560 


S175 


1 


1 


GRVDWLQRNANFYDWFVAELG 


2130 


RP15 


1 


1 


SQAGSAFYAWFDQVIRTV 


1635 


Rp6 


2 


1 


TFYSCLASLLTGTPQPNRGPWERCR 


2131 


RP17 


1 


1 


QSDAFYSGLWALIGLSDG 


1595 


Group 6 


10 


1 


DSOWAGYEWFEEQLD 



Example 8: Peptide Fusions To The Maltose Binding Protein 

10 A. Cloning 

The transfer of interesting peptide sequences from phage display to 
maltose binding protein (MBP) fusions is desirable for several reasons. 
First, to obtain a more sensitive affinity estimate, the polyvalency of phage 
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display peptides should be converted to a monovalent system. For this 
purpose, the peptide sequences are fused to MBP that generally exists as a 
monomer with no cysteine residues. Second, competition experiments can 
be carried out with the same or different peptides, one phage displayed and 
5 the other fused to MBP. Lastly, purified peptides can be obtained by 
cleavage of the fusion protein at a site engineered in the DNA sequence. 

Figure 28 shows a schematic drawing of the MBP-peptide construct. 
In the construct, the N-terminus of the peptide sequence is fused to the C- 
terminus of the MBP. Two peptide-flanking epitope tags are included, a 

10 shortened-FLAG® at the N-terminus and E-Tag at the C-terminus. The 
corresponding gene fusion was generated by ligating a vector fragment 
encoding the MBP in frame with a PCR product encoding the peptide of 
interest. The vector fragment was obtained by digesting the plasmid pMAL- 
c2 (New England Biolabs) with EcoRI and Hind\\\ and then treating the 

15 fragment with shrimp alkaline phosphatase (SAP; Amersham). The 
digested DNA fragment was resolved on a 1% agarose gel, excised, and 
purified by QIAEXII (QIAGEN). The 20-amino acid peptide sequences of 
interest were initially encoded in the phage display vector pCANTABSE 
(Pharmacia). To obtain these sequences, primers were synthesized which 

20 anneal to sequences encoding the shortened FLAG® or E-Tag epitopes and 
also contain the required restriction enzyme sites EcoRI and Hind\\\. PCR 
products were obtained from individual phage clones and digested with 
restriction enzymes to yield the insert fragment. The vector and insert were 
ligated overnight at 15°C. The ligation product was purified using QIAquick 

25 spin columns (QIAGEN) and electroporations were performed at 1500 v in 
an electroporation cuvette (0.1 mm gap; 0.5 ml volume) containing 10 ng of 
DNA and 40 pi of E. coli strain ER2508 (RR1 lon:miniTn10(Tef) (malB) 
(argF-lac)U169 Pro* z/c::Tn5(Kan r ) fhuA2) electrocompetent cells (New 
England Biolabs). Immediately after the pulse, 1 ml of pre-warmed (40°C) 

30 2xYT medium containing 2 % glucose (2xYT-G) was added and the 
transformants were grown at 37°C for 1 h. Cell transformants were plated 
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onto 2xYT-AG plates and grown overnight at 37°C. Sequencing confirmed 
the clones contained the correct constructs. 

B. Small-Scale Expression of Soluble MBP-Peptide Fusion 
Proteins 

5 E. coli ER2508 (New England Biolabs) carrying the plasmids 

encoding MBP-peptide fusion proteins were grown in 2xYT-AG at 37°C 
overnight (250 rpm). The following day the cultures were used to inoculate 
media (2x YT containing-G) to achieve an ODeoo of 0.1. When the cultures 
reached an OD600 of 0.6, expression was induced by the addition of IPTG to 
10 a final concentration of 0.3 mM and then cells were grown for 3 h. The cells 
were pelleted by centrifugation and samples from total cells were analyzed 
by SDS-PAGE electrophoresis. The production of the correct molecular 
weight fusion proteins was confirmed by Western blot analysis using the 
monoclonal antibody anti-E-Tag-HRP conjugate (Pharmacia). 

15 C. Large-Scale Expression of Soluble MBP-Peptide Fusion 

Proteins 

E. coli ER2508 carrying plasmids encoding the MBP-peptide fusion 
proteins were grown in 2xYT-AG media for 8 h (250 rpm, 37°C). The 
cultures were subcultured in 2xYT-AG to achieve an OD600 of 0.1 and grown 

20 at 30°C overnight. This culture was used to inoculate a fermentor with 
medium of following composition (g/l): glucose (3.00); (NH 4 ) 2 S0 4 5.00; 
MgS0 4 • 7H 2 0 (0.25); KH 2 P0 4 (3.00); citric acid (3.00); peptone (10.00); 
and yeast extract (5.00); pH 6.8. 

The culture was grown at 700 rpm, 37°C until the glucose from the 

25 medium was consumed (OD 6 oo = ~6.0 - 7.0). Expression of the fusion 
protein was induced by the addition of 0.3 mM IPTG and the culture was 
grown for 2 h in fed-batch mode fermentation with feeding by 50 % glucose 
at a constant rate of 2 g/l/h. The cells were removed from the medium by 
centrifugation. Samples of the cell pellet were analyzed by SDS-PAGE 
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followed by the Western blot analysis using the mouse monoclonal antibody 
anti-E-Tag-HRP conjugate (Pharmacia) to visualize the expressed product. 

D. Purification 

The cell pellets were disrupted mechanically by sonication or 
5 chemically by treatment with the mild detergent Triton X-100. After removal 
of cell debris by centrifugation, the soluble proteins were prepared for 
chromatographic purification by dilution or dialysis into the appropriate 
starting buffer. The MBP fusions were initially purified either by amylose 
affinity chromatography or by anion exchange chromatography. Final 

10 purification was performed using anti-E-Tag antibody affinity columns 
(Pharmacia). The affinity resin was equilibrated in TBS (0.025 M Tris- 
buffered saline, pH 7.4) and the bound protein was eluted with Elution buffer 
(100 mM glycine, pH 3.0). The purified proteins were analyzed for purity 
and integrity by SDS-PAGE and Western blot analysis according to standard 

15 protocols. 

For MBP fusions, IR agonist activity was observed for the Site 1-Site 
1 dimer peptides shown in Table 10, below. Additional binding data for the 
MBP fusions are shown in Table 1 1 , also below. 
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E. BIAcore Analysis 

For BIAcore analysis of fusion protein and synthetic peptide binding to 
insulin receptor, insulin (50 ng/ml in 10 mM sodium acetate buffer pH 5) was 
immobilized on the CMS sensor chip (Flowcell-2) by amine coupling. 
5 Flowcell-1 was used for background binding to correct for any non-specific 
binding. Insulin receptor (450 nM) was injected into the flow cell and the 
binding of IR to insulin was measured in resonance units (RUs). Receptor 
bound to insulin gave a reading of 220 RU. The surface was regenerated 
with 25 mM NaOH. Pre-incubation of receptor with insulin in a tube at RT 

10 completely abrogated the response units to 16 RU. Thus, the system was 
validated for competition studies. Several maltose-binding fusion proteins, 
peptides, and rVabs were pre-incubated with insulin receptor before injecting 
over the insulin chip for competition studies. The decrease in 
binding/resonance units indicates that several MBP-fusion proteins can block 

15 the insulin-binding site. The results are shown in Tables 12 and 13. The 
amino acid sequences referred to in the tables are identified in Figures 8 and 
9A-9B, except the 447 and 2A9 sequences, which are shown below. 



TABLE 12 

BIAcore Results — Fusion Proteins Compete for Binding to IR 





Incubation Mixtures 


Result (RUs) 


Sequence Type 


Controls 


Insulin Receptor (IR) 450 nM 


220 


Positive Control 




Insulin ( 8.7 pM) 


16 


Negative Control 


MBP Fus. Prots. 


A7(20A4)-MBP (4.1 pM) + IR 


43 


Formula 6 Motif 




D8-MBP (1.6mM) + IR 


56 


Formula 6 Motif 




D10-MBP (3.4mM) + IR 


81 


Formula 1 1 Motif 




447-MBP(11.5 mM) + IR 


195 


hGH Pept. Fus. 




MBP(13gM)+IR 


209 


Negative Control 



20 

The A7 (20A4), D8, and D10 peptide sequence are shown in Figures 8 and 9A-9B. The 447 
peptide sequence is: LCQRLGVGWPGWLSGWCA (SEQ ID NO:2156). 
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TABLE 13 



BIAcore Results - Synthetic peptides compete for binding to IR 



Incubation Mix 


% Binding 


Result (RUs) 


Sequence Type 


IR 


100 


128 


Positive control 


IR + 20D1 


41 


51.8 


Formula 1 Motif 


IR+D8 


33 


41.6 


Formula 6 Motif 


IR + 20C11 


38 


49 


Formula 2 Motif (bkg high) 


IR + H2 


27 


34.6 


IGF (phosphorylated band) 


IR + 2A9 


100 


128 


IGF(bkg high) 


IR + 20A4 


33 


41.8 


Formula 6 Motif 


IR + p53wt 


97 


124.5 


P53 wild type 



The concentration of each peptide was about 40 pM and the concentration of IR 
5 was 450 nM. The 20D1, 20A4, and D8 peptide sequences are shown in Figures 8 and 9A- 
9B. The remaining peptide sequences are as follows: 447 = LCQRLGVGWPGWLSGWCA 
(SEQ ID NO:2156); 2A9 = LCQSWGVRIGWLTGLCP (SEQ ID NO:2157); 20C11 = 
DRAFYNGLRDLVGAVYGAWD (SEQ ID NO:1659); H2 = VTFTSAVFHENFYDWFVRQVS 
(SEQ ID NO:1784). 

10 

Regarding preparation of a Site 1 agonist comprising two D1 17 (H2C) 
peptides, a linker of only 3 amino acids (12 A) provided a ligand of greater 
affinity for Site 1 of IR than a corresponding ligand prepared with a 9 amino 
acid (36 A) linking region (Figure 29). 



1 5 F. Stimulation of Autophosphorylation of IR by MBP-Fuslon 

Peptides 

MBP fusion peptides were prepared as described above, and then 
assayed for autophosphorylation of a insulin receptor construct transfected 
into 32D cells (Wang et a/., 1993; clone 969) (see Example, above). The 

20 results of these experiments shown in Figure 30 indicate that the H2C 
monomer and H2C-H2C homodimer peptides stimulate autophosphorylation 
of IR in vivo. H2C dimer peptides (Site 1-Site 1) with a 6 amino acid linker 
(H2C-6aa-H2C) were most active in the autophosphorylation assay. Other 
active dimer peptides are also shown in Figure 30, particularly H2C-9aa- 

25 H2C, H2C-12aa-H2C, H2C-3aa-H2C, and F8. 
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G. Insulin Receptor Binding Affinity and Fat Cell Potency of 
MBP-Fusion Peptides 

Results of assays to determine binding affinity for insulin receptor and 
fat cell potency of the MBP-fusion peptides are shown in Table 14, below. 

5 



131 



PCT/US02/30312 



FFC 






































Ii 


% 

* 


1 

* 












CD 
CD 


LO 


■fa 

« 


CO 


o 

CM 








* 

CO 


CO 
CO 


% 

CM 
CO 


j 


1 


MBP...NNNNLGIEGRISEFIEGR AQPAMA WLDQEWAWVQCEVYGRGCPS AAA(ETAG)AA 


% 

LU 

CL 
O 

1 
8 
1 

LU 

a 

Q 

_ i 

CO 

CD 

8 

O 
CD 

CO 
CL 

o 
cd 
or 

CD 
LU 

| 
1 

LU 
O 

CL 

a 

s 

Lu 

LU 
CO 

a 

CD 
Ul 

CD 
_i 
2 
Z 
Z 
Z 

CL 
CO 

2 


1 MBP...NNNNLGIEGRISEFIEGRDYKDDDDK HLCVLEELFWGASLFGYCSGAAA(ETAG)AA ! 


I 

I 
1 

—i 

LU 
LU 

> 

o 
—I 
X 
CO 
CD 
CD 
CO 
CD 
CD 

> 
CD 

LL. 

_J 

CO 
< 

i 

Lu 
__j 
LU 
LU 

si 

X 

a 

Q 

a 

> 
a 
or 

CD 
UJ 
Ll_ 
UJ 
CO 
OT 
CD 
UJ 

CD 
_j 
Z 

z 
z 
z 

CL 
CO 

5 


MBP...NNNNLGIEGRISEFIEGRDYKDDDDK HLCVLEELFWGASLFGYCSGGGSGGSGGSGGS 
HLCVLEELFWGASLFGYCSGAAA(ETAG)AA 


MBP...NNNNLGIEGRISEFIEGRDYKDDDDK HLCVLEELFWGASLFGYCSGGGSGGS HLCVLEELFWGASLFGYCSGGGSGGS 
HLCVLEELFWGASLFGYCSGAAA(ETAG)AA 


MBP...NNNNLGIEGRISEFEGRAQPAMA FHENFYDWFVRQVSAAA(ETAG)AA 


| MBP...NNNNLGIEGRISEFEGRAQPAMAFHENFYDWFVRQVSGGSFHENFYDWFVRQVSAAA(ETAG)AA j 


8 

UJ 

i 

LU 

§ 

> 
LU 
Z 
LU 
X 
LU 
CO 
CD 
CD 
CO 
CD 
CD 
CO 

oc 

> 

a 
>■ 

LU 

z 

LU 

E 

< 
CD 

LU 
LU 
LU 
CO 
DC 
CD 
LU 

CD 
_l 
z 

Z 
Z 
Z 

cti 

CQ 
2 


MBP...NNNNLGEGR1SEFEGRAQPAMAFHENFYDWFVRQVSGGSGGSGGSFHENFYDWFVRQVSAAA(ETAG)AA 


LU 

1 

z 

UJ 
X 

CL 

a 
< 

CO 
CD 
CD 
CO 
CD 

CD 
CD 
CO 
CD 

I 

CC 

> 

Q 
>- 
LU 

Z 
LU 
X 
LU 

Cu 

a 
< 
cc 

CD 
LU 
Lu 
LU 
CO 

cc 

CD 
LU 

CD 
_i 
z 
z 
z 
z 

§5 
2 


1 

1 

I 

X 

tu. 

CD 

a 
or 
> 

t 

Z 
LU 
X 

LU 

CO 

CD 
CD 

CO 

> 
a 

LU 

a 
z 

LU 
X 
LU 

I 

CD 
LU 
LU 
LU 
CO 

cc 

CD 
LU 

CD 

t 

z 
z 
z 
z 

cu 

CO 

2 


i 

LU 

a 

5 

Z 
DC 

o 

—J 

> 

g 

CC 
CD 
LU 
LU 
LU 
CO 

cc 

CD 
LU 

CD 
_i 
Z 

z 
z 
z 

CL 
CQ 

2 


CD 

1 

1 
| 

a 

z 

g 

cc 

CD 
CO 
CD 
CD 
CO 
CD 
CD 
CD 

1 

a 

it 

Z 

| 

a 
o 

Q 
Q 

> 
Q 

or 

CD 

LU 
LL 
LU 
CO 

or 

CD 
LU 

CD 
_J 
Z 

z 
z 
z 

CL 
CQ 

2 < 


i MBP...NNNNLGEGRISEFIEGRDYKDDDDKVRVDWLQRNANFYAWFVAELGGGSGGSGGSGGSGRVDWLQRNANFYDWFVAELGAA 
| A{ETAG)AA 


1 

LLJ^ 

CD 
< 

CD 
X 

5£ 

or 

S5 

LU 

LU 

CD 
CD 
CD 

CL 

a 

CD 

LU 
LU 
LU 
CO 

or 

CD 
LU 

CD 
_l 

z 
z 
z 
z 

CL 
CQ 

2 


$ 

5T 

LU 

3 

_i 

UJ 

LL 
1— 

CD 
CD 
CD 

K 

CD 
CD 
CO 
CD 
CD 
CD 
< 
CD 
X 

& 

LU 
LU 

>- 
t 

CD 
CD 
CD 

or 

CL 

1 

CD 
LU 
LL 
LU 
CO 

or 

CD 
LU 

CD 
_i 
Z 
Z 

z 
z 

CL 
CO 

2 


MBP. .. NNNNLGIEGRISEFIEVRAQPAMA 

RGGGTFYEWFESALRKHGAGGGSGGSGGSGGSRGGGTFYEWFESALRKHGAGAAA(ETAG)AA 


is 


CM 


CN 
CNI 


CM 


CM 


CM 

CM 


CM 
CM 
CM 




i— 


* — 




T 












*7 




LL 


1 








2 


2 


2 




Lu 


LU 


LU 


Lu 


LU 




Lu 






LU 


LU 
1 


:0NO 
03S 


OO 

vo 

C\j 


<Ji 
U> 

Csj 


O 

CO 

CM 


CO 

c\i 


2 

CM 


ro 

CO 
CM 


3 


CM 


CO 

CO 
CM 


CO 
CM 


3 

CM 


92 
CO 

CM 


g 


CM 


CM 
CM 


CO 

r-. 

CM 


CM 


uo 

CM 


Peptide 


CO 

s 


§ 


1 RB505M | 


| RB517M | 


LO 

s 

ar 


O 

s 

or 


ft 

2 

cc 




a> 

s 




c? 


LO 

ro 

i 


CM 

w 


RB508M 


RB509M 


CN 

1 

or 


i 





WO 03/070747 



132 



PCT/US02/30312 





























♦ 


4- 
■♦- 








+ 




+ 
♦ 






t 
+ 








So 

CO 


i> 


o 

pi 


% 

* 


% 

V 




s 


«? 

o 

1— 

tn 
«o 


% 

i 


CM 


% 


O 
CO 


CO 


i 

o 

CO 


% 


cb 
o 
T 


T 


o 
o 


CD 

«o 




o 
* 

O 

CO 


I 

cd 

_J 

a 
or 

LU 

>- 

lu 

CO 
LU 

9 

CO 
CD 

8 

O 

Q 

CC 
CD 

LU 
Li_ 
LU 
CO 
QT 
CD 
LU 

cd 
—i 
z 
z 
z 
z 

oJ 

CD 

S 


1 

LU 

LU 
O 

CO 

cd 

CO 

CD 
CD 

* 
CD 
_i 

a 
or 

UJ 

a 

>- 

LU 
CO 
LU 

9 

CO 

CD 

8 

Q 
Q 

> 

Q 

CC 
CD 

LU 

Lu 
LU 
CO 

cc 

CD 
LU 

CD 
_j 

I 

Z 

z 

Q_ 
CO 

S 


1 

3 

or 

UJ 
LU 

| 

CO 
UJ 

9 

CO 

CD 

CO 

CD 
CD 
CO 
CD 
CD 

CD 
_J 

8 

UJ 
LU 

| 

CO 
UJ 

1 

CD 

o 

Q 

o 

§ 

UJ 
LU 

UJ 
CO 

S 

UJ 

i 

ml 

s ; 


LU 

1 

z 

LU 
I 
LU 

8 

Q 
O 

>- 

LU 

LU 

Lo 

DT 
CD 
LU 

CD 
— i 
Z 

z 
z 
z 

CL 
CO 

s 


UJ_ 

CD 

>- 
CD 

LU 
— J 
LU 
LU 
_l 
> 

o 

—i 

i 

8 

Q 
Q 
^ 

>- 
O 

LU 

UL 

w 

oc 

CD 
UJ 

CD 
z 

I 

CL 

s 


UJ^ 
CO 

& 

UJ 
LU 
U_ 

or 

1 

> 

i 

CD 

LU 
LU 

<o 
or 

CD 
LU 

3 

z 

z 
z 

CL 
CO 

S 


1 

CO 

& 

LU 
LU 
LU 

| 

—1 

CO 

CD 
CD 

CO 
CD 
CD 

o 

S£ 

LU 
£ 

Q 
>- 

LU 

Z 
LU 
X 
LU 
*T 
Q 
Q 
O 

§ 

Q 
CC 
CD 
LU 

s 

LU 

o 

z 
z 
z 
z 

0_ 
CO 

s 


z 

1 

CO 
Q_ 

or 

LU 
UJ 

or 
S 
~j 
or 
o 
i 

§ 

CP 

8 

CO 
LU 

z 

UJ 
X 

o 

Q 
O 

? 

co 
o 
o 


o 

8 
3 

LU 

1 

_l 

X 
CO 
CD 

8 

CD 
CD 
CO 

5 

or 

2 

Q 
>- 
Lu 
Z 
LU 
X 

Q 
D 
Q 

XL 
>■ 
Q 

or 

CD 

LU 

8 

9 

LU 

2 

Z 

z 
z 
z 

& 

s 


% 

t— 

LU 

! 

CO 

§ 

u_ 

— 1 

CO 

< 
o 

_J 
X 

CO 

CD 

8 

CO 

> 

LU 

i 

LU 
Z 
LU 
X 

& 

o 
a 

Q 

4> 


$ 

fS 

UJ 

CO 

o 
eg 

UJ 
X 

8 

CO 

o 
e> 

CO 

CD 

8 

CD 
CD 

> 

LU 

£ 

Z 
UJ 
X 

LU 

a 
o 

§ 

1 

CD 

UJ 
r— 

Lt_ 
LU 
CO 

or 

CD 
LU 

CD 
_i 
z 
z 
z 
z 


5 

I 

§ 

_ ( 
LU 
LU 

o 

_J 
X 

CO 
CD 
CD 

CO 

CD 
CD 
CO 
CD 

8 

CD 
CD 
CO 

Lu 

LU 
X 
LU 

be: 
o 
a 
o 

i 

o 


I 

CD 
CO 
O 

5 

LU 

i 

X 

CO 

§ 

CO 
CD 
CD 
CO 
CD 
CD 

a 

LU 

LU 
X 
LU 

o 

Q 
§ 

CD 

LU 

LU 
LU 
CO 

S 

UJ 

CD 

i 

Z 

z 

z 
z 

3i 
s 


i 

O. 
LU 
—I 

Q_ 

S 

> 
—I 

1 

LU 
Z 

a 
_j 

? 
or 

8 

CD 
O 

CD 
CD 

LU 

>- 
LU 

| 

a 
a 
a 

Q 


1 

o 
cu 

£ 

! 

or 

CD 

CO 

CD 
CD 
to 
CD 

cT? 
CD 
CD 
CO 
CD 
CD 

§! 

LU 

>- 
LU 
Z 

* 
| 

8 

Q 
Q 
^ 
>- 
O 


LU 

LU 

UJ 

5i 
o 
—j 

X 

Q 
Q 

Q 

9 
>- 
o 


3 

UJ 
CO 

o 

CO 

< 

CD 

LU 
LU 

_l 

s 

CD 
CD 
CO 

8 

o 

i 

UJ 

o 

1 

x 
a 

1 


CD 
a 

I 

CD 

a 
or 

LU 

i 
e 

LU 
CO 
UJ 

§ 

CD 

CO 

CD 
CD 
CO 

CD 
CD 

CD 
_j 

a 
or 

LU 
LU 

>- 
LU 
CO 
UJ 

9 

CO 

s 

Q 
Q 
Q 

1 


I 
I 

UJ 

Q 
>- 
LU 
CO 
LU 
O 

( 

CO 
CD 
CO 
CD 

8 

CD 
CD 

CO 

CD 
CD 

CO 

CD 
CD 

-XL 
*C 

I 

UJ 
LU 

£ 

CO 
UJ 

s 

Q 
Q 
Q 

> 


o 

I 

cr 

o 

CD 

a 

LU 

o 
o 

Q 

| 


£ 

I 

S 

Q 

it 

CO 
LU 
O 
-J 

CO 

CD 

CO 

8 

CO 
CD 

8 

Q- 

i 

O 

a 

o 
a 

Q 

§ 


CD 

Q. 

I 
I 

O 

o 
o 

0 

UJ 

? 

CO 

CD 
O 
CO 

8 

CD 

2 

LU 

LU 

§ 

CO 
LU 
Q 
_» 
CO 

Q 
Q 
O 

s 


CD 
o_ 

I 

UJ 
CL 

8 
% 

UJ 

O 

1 

UJ 

8 
£ 

CO 

o 

8 

CD 
CD 
CO 
CD 
CD 
CO 

I 
1 

LU 

LU 

£ 
ft 

LU 

C^ 

o 

O 

s 

> 


CD 

CL 

1 
I 

Q 

>- 

LU 
CO 
LU 
O 

CO 
CD 
CO 

CD 
CD 
CO 

CD 
CD 

8 
1 

i 

LU 
< 
LU 
LU 

§ 


















CM 


CM 


CM 


CM 


CM 






CM 


CM 
CM 






CM 


CM 


CM 


CM 


CM 




LU 
LL 


LU 
LU 


LU 


U_ 


co 

LU 


CO 
LU 

LU 


CO 
LU 

LU 


Ta- 
li. 

LU 


LL. 
LU 


£ 

LU 


Tt 
LU 

LU 


"r 

LU 


LU 


LU 


TT 
LU 


Tt 

LU 
LU 


LU 
LU 


LU 
LU 


CO 
LU 


LU 
CD 
LU 


CO 
LU 

LU 


CO 
LU 

LU 


LU 
CO 
LU 


1 2176 


CM 


CO 

CM 


a> 


o 

CO 
CN 


CN 


CM 
CO 

CM 


CO 
CO 

eg 


3 

CN 


U> 
CO 

CM 


«£> 
CO 

CM 


S- 
flO 

CM 


CO 
CO 

CM 


<Ji 
OD 

CM 


O 

Ol 

CM 




CM 


CO 

cn 
CM 


CM 


to 
a* 

CM 


CD 
CD 

T— 

CM 


CD 
CM 


CO 

a> 

CM 


cn 
CO 

CM 


| RB513 I 


CD 
tr- 
s 


<N 


3 
3 


a 




O) 
lO 

•<*- 

CO 

or 


o 

or 


c 


> 




T— 

CO 


s 

or 


CM 

a: 


CO 

3 
or 


CO 

s 

CO 

or 


i 




55 

CD 

or 


CM 

m 


in 
co 

iT> 

CO 

or 


CO 

or 


CO 

s 


r-- 
co 

CO 
CO 

or 


oo 
co 

00 
CC 


CD 
CM 
CO 
CD 

or 



WO 03/070747 PCT/US02/30312 

133 



S3 



8 



OQ 



b 



WO 03/070747 



PCT/US02/30312 



-134- 

Example 9: In Vivo Assays for Insulin Agonists 

To test the in vivo activity of dimer peptide S519, an intravenous 
blood glucose test was carried out on Wistar rats. Male Mol:Wistar rats, 
weighing about 300 g, were divided into two groups. A 10 fjl sample of 
5 blood was taken from the tail vein for determination of blood glucose 
concentration. The rats were anaesthetized with Hypnorm/Dormicum at t = - 
30 min and blood glucose was measured again at t = -20 min and at t = 0 
min. After the t = 0 sample was taken, the rats were injected into the tail 
vein with vehicle or test substance in an isotonic aqueous buffer at a 

10 concentration corresponding to a 1 ml/kg volume of injection. Blood glucose 
was measured at times 10, 20, 30, 40, 60, 80, 120, and 180 min. The 
Hypnorm/Dormicum administration was repeated at 20 minute intervals. 
Results shown in Figure 33 demonstrate that the S519 (at 20 nmol/kg) 
peptide lowered blood glucose levels similar to levels observed for human 

15 insulin (at 2.5 nmol/kg) (n=8). The S519 peptide and human insulin showed 
comparable in vivo effects, both in magnitude and onset of response (Figure 
33). 

Example 10: IGF-1 Surrogates 

Three major groups of peptide IGF-1 surrogates were obtained from 
IGF-1 R panning experiments: Site 1 A6 (FyxWF) (SEQ ID NO: 1596); Site 1 
B6 (FyxxLxxL) (SEQ ID NO: 1732), and Site 2 (C-C looped). See Beasley 
et a/. International Application PCT/US00/08528, filed March 29, 2000, and 
Beasley et a/., U.S. Application Serial No. 09/538,038, filed March 29, 2000. 
Active surrogates included 20E2 and RP6 (B6-like; Formula 2), S175 (AG- 
like; Formula 1), G33 (A6-like; Formula 1), RP9 (A6-like; Formula 1), D815 
(Site 2), and D8B12 (Site 2) peptides. The IGF-1 surrogates were analyzed 
by various assays, described as follows. 



20 



25 
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A. Phage Competition 

Phage competition studies were performed with Site 1 (RP9) and Site 
2 (D815) monomer peptides. Plates were coated with IGF-1R (100 ng/well 
in carbonate buffer, pH 9.6) overnight at 4°C. Wells were blocked with 4% 
5 non-fat milk in PBS for 60 min at room temperature. One hundred 
microliters of rescued phage were added to each well. Peptides in varying 
concentrations were added and the mixtures were incubated for 2 hr at room 
temperature. Plates were washed three times with PBS and 100 |il of anti- 
Mi 3 antibody conjugated to horseradish peroxidase was added to each 
10 well. The labeled antibody was incubated at room temperature for 60 min. 
After washing, 100 nl of ABTS was added per well and the plates read in a 
microtiter reader at 450 nM. 

Phage included RP9 (A6-like; Formula 1); RP6 (B6-like; Formula 2); 
D8B12 (Site 2); and D815 (Site 2). Peptides included RP9 and D815. 

15 



Peptide 


Formula 


Site 
IGF-1R 


Sequence 


SEQ ID 

NO: 


D8B12 


6 


2 


WLEQERAWIWCEIQGSGCRA 


1884 


D815 


6 


2 


WLDQERAWLWCEISGRGCLS 


2206 


RP6 


2 


1 


TFYSCLASLLTGTPQPNRGPWERCR 


1635 


RP9 


1 


1 


GSLDESFYDWFERQLG 


1559 



Results shown in Figures 34A-34E demonstrate that that RP9 and 
D815 peptides competed both Site 1 and Site 2 phage. These results 
illustrate the allosteric nature of the interaction with IGF-1R. 

20 Phage competition studies were also performed with Site 2-Site 1 

dimer peptides containing 6- or 12-amino acid linkers. Plates were coated 
with IGF-1R (100 ng/well in carbonate buffer, pH 9.6) overnight at 4°C. 
Wells were blocked with 4% non-fat milk in PBS for 60 min at room 
temperature. One hundred microliters of rescued phage were added to 

25 each well. Peptides in varying concentrations were added and the mixture 
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incubated for 2 hr at room temperature. Plates were washed three times 
with PBS and 100 \i\ of anti-M13 antibody conjugated to horseradish 
peroxidase was added to each well. The labeled antibody was incubated for 
60 min at room temperature. After washing, 100 \i\ of ABTS was added per 
5 well and the plates read in a microtiter reader at 450 nM. Phage included 
RP9, RP6 F D8B12, and D815. Peptides included D815-6L-RP9 and D815- 
12L-RP9. Linker sequences are underlined and shown below. 



Peptide 


Formula 


Site 
IGF-1R 


Sequence 


SEQ ID 

NO: 


D815-6L- 
RP9 


6-1 


2-1 


LDQERAWLWCEISGRGCLSGGSGGSGSLDESFYDWFERQLGK 
K 


2207 


D815- 
12L-RP9 


6-1 


2-1 


WLDQERAWLWCEISGRGCLSGGSGGSGGSGGSGSLDESFYD 
WFERQLGKK 


2208 



D8B12, D815, RP6, and RP9 amino acid sequences are shown in the 
10 previous section. Results shown in Figures 35A-35E demonstrate that 
dimers competed both Site 1 and Site 2 phage. This indicates that both 
dimer units were active at IGF-1 R. 

B. IGF-1 Proliferation Assays 

FDCP-2 cells expressing the IL-3 and human IGF-1 R receptors were 
15 grown in RPMIk-1640 medium supplemented with 15% fetal bovine serum 
(FBS) and 5% WEHI conditioned medium (containing IL-3) in accordance 
with routine methods; Prior to an experiment, the cells were pelleted and 
washed two times in PBS. Following this, cells were resuspended in RPMI- 
1640 medium with 2% FBS and added to a 96-well plate at a concentration 
20 of 2 x 10 4 cells/well in 75 y\. This was designated as the cell plate. 

Peptides were suspended in PPMI-15% FBS (test medium). For the 
agonist assay, medium was added to rows 2-12 of a 96 well plate. The 
peptide was added to row 1 in 200 jal of test medium at a final concentration 
of 60 jiM. The peptide was serially diluted (1:1) across rows 2-11. No 
25 peptide was added to row 12 (control; cells without IGF-1). For the 
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antagonist assay, test medium containing 10 ng/ml IGF-1 (ED50 test 
medium) was added to all wells of a 96 well plate. To row 1 was added 100 
of peptide in ED50 test medium at a concentration of 120 jaM. The peptide 
was serially diluted (1 :1 ) across rows 2-1 1 . No peptide was added to row 12 
5 (control; cells with IGF-1 ). 

For both agonist and antagonist assays, 75 jal from the working plates 
was transferred to the appropriate rows in comparable cell plates. The 
starting peptide concentration for both agonist and antagonist assays was 
30 \xM. Each peptide was done in duplicate. Plates were incubated at 37°C 
10 for 45-48 hr. Ten microliters of WST-1 (Cell Proliferation Reagent, Roche 
cat # 1 644 807) were added to each well and the plates were read in an 
ELISA reader (440/700 dual wavelength) each hour for 4 hr. Graphs were 
prepared from the raw data using Sigma Plot. Peptides included: 



Peptide 


Formula 


Site 


Sequence 


SEQ 






IGF-1 R 




ID NO: 


20E2 


2 


1 


DYKDFYDAIDQLVRGSARAGGTRD 


2209 


D815 


6 


2 


WLDQERAWLWCEISGRGCLS 


2206 


G33 


1 


1 


GIISQSCPESFYDWFAGQVSDPWWCW 


1600 


RP6 


2 


1 


TFYSCLASLLTGTPQPNRGPWERCR 


1635 


RP9 


1 


1 


GSLDESFYDWFERQLG 


1559 


S175 


1 


1 


GRVDWLQRNANFYDWFVAELG 


1560 



15 

Results of the IGF-1 proliferation assays are shown in Figures 36-42. 
Figure 36 demonstrates that that peptides G33 (Site 1; ED 50 ~ 10 \M) and 
D815 (Site 2; ED 50 - 2 \M) showed agonist activity at IGF-1 R, whereas 
peptides RP9 and RP6 showed no agonist activity. Figure 37 demonstrates 
20 that that peptides RP6 (Site 1; ED 50 ~ 1 \M) and RP9 (Site 1; ED 50 - 7 \xU) 
showed antagonist activity at IGF-1 R, whereas peptides G33 and D815 
showed no antagonist activity. Figure 38 demonstrates that peptides S175 
and 20E2 exhibited weak agonist activity at IGF-1 R (ED50 > 10 ^iM). Figure 
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39 shows that D815-RP9 dinners with 6- or 12-amino acid linkers acted as 
agonists at IGF-1R. Figure 40 shows that dimer peptide D815-6-G33 was 
inactive as an agonist at IGF-1R. Figure 41 shows that monomer peptide 
RP6 acted as an antagonist at IGF-1 R. The IGF-1 standard curve 
5 determined for FDCP-2 cells is shown in Figure 42. 

The IGF-1 R data for the Site 1 and Site 2 peptides is summarized in 
Table 15, below. 
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Example 11: Panning Peptide Libraries 

A, Panning IGF-1 Surrogate Secondary Libraries 

Soluble IGF-1 R ("slGF-1R") was obtained from R&D Systems. The 
soluble protein (> 95% pure) included the heterotetrameric (alpha 2-beta 2) 
5 extracellular domain of IGF-1 R isolated from a mouse myeloma cell line. 
slGF-1R (500 ng/well) was added to an appropriate number of wells in a 96- 
well microtiter plate (MaxiSorp plates, NUNC) and incubated overnight at 
4°C. Wells were then blocked with MPBS (PBS buffer pH 7.5 containing 2% 
Carnation® non-fat dry milk) at room temperature (RT) for 1 h. Eight wells 

10 were used for each round of panning for the G33 and RP6 secondary 
libraries. The phage were incubated with MPBS for 30 min at RT, then 100 
pi was added to each well. 

For the first round, the input phage titer was 4 x 10 13 cfu/ml. For 
rounds 2 and 3, the input phage titer was approximately 10 11 cfu/ml. Phage 

15 were allowed to bind for 2 to 3 h at RT. The wells were then quickly washed 
13 times with 200 pl/well of MPBS. Bound phage were eluted by incubation 
with 100 |jl/well of 20 mM glycine-HCI, pH 2.2 for 30 s. The resulting 
solution was then neutralized with Tris-HCI, pH 8.0. Log phase TG1 cells 
were infected with the eluted phage, then plated onto two 24 cm x 24 cm 

20 plates containing 2xYT-AG. The plates were incubated at 30° C overnight. 
The next morning, cells were removed by scraping and stored in 10% 
glycerol at -80° C. For subsequent rounds of affinity enrichment, cells from 
these frozen stocks were grown and phage were prepared as described 
above. A minimum of 72 clones was picked at random from the second, 

25 third, and fourth rounds of panning and screened for binding activity. DNA 
sequencing of the clones determined the amino acid sequences 
summarized in Figure 43A-43B. 
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B. Panning Peptide Dimer Libraries 

Microtiter plates were coated and blocked by standard methods, as 
follows. Plates were coated with slGF-1R (see Example, above) or soluble 
IR (Bass construct; Bass et aL, 1996, J. Biol. Chem. 271:19367-19375) in 
5 0.2 M NaHC0 3f pH 9.4. One hundred microliters of solution containing 
either 50 ng IR or IGF-1 R (rounds 1 and 2), 25 ng IR or IGF-1 R (round 3), or 
12.5 ng IR or IGF-1 R (round 4) was added to an appropriate number of 
wells in a 96-well microtiter plate (MaxiSorp plates, Nalge NUNC) and 
incubated overnight at 4°C. Wells were then blocked with a solution of 2% 

10 non-fat milk in PBS (MPBS) at RT for at least 1 h. 

Eight wells coated with IR or IGF-1 R were used for each round of 
panning. One hundred microliters of phage were added to each well. For 
the first round, the input phage titer was 3 x 10 13 cfu/ml. For subsequent 
rounds, the input phage titer was approximately 10 12 cfu/ml. Phage were 

15 incubated for 2-3 h at RT. The wells were then quickly washed 13 times 
with 300 pl/well of PBS. Bound phage were eluted by incubation with 150 
pl/well of 50 mM glycine-HCI, pH 2.0 for 15 min. The resulting solution was 
pooled and then neutralized with Tris-HCI, pH 8.0. Log phase TG1 cells 
were infected with the eluted phage, in 2xYT medium for 1 hr at 37°C prior 

20 to the addition of helper phage, ampicillin, and glucose (2% final 
concentration). 

After incubation for 1 hr at 37°C, the cells were spun down and 
resuspended in 2xYT-AK medium. The cells were then returned to the 
shaker and incubated overnight at 37°C. Phage amplified overnight were 

25 then precipitated and subjected to the next round of panning. A total of 96 
clones were picked at random from rounds 3 and 4 and screened for binding 
activity. Several clones from each pan were further tested for binding to IR 
or IGF-1 R in phage ELISA by competition with soluble peptides as 
described in Beasley et a/. International Application PCT/US00/08528, filed 

30 March 29, 2000, and Beasley et aL, U.S. Application Serial No. 09/538,038, 
filed March 29, 2000. Competition was performed by addition of 5 jil of RP9 
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peptide, recombinant D8 peptide, or both per well, followed by addition of 
100 \i\ of phage per well. Representative peptides are shown in Figures 
44A-44B and in Table 16, below. 
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C. Determination of Amino Acid Preferences 

For both monomer and dimer peptides, amino acid preferences for 
each peptide were determined as follows. The expected frequency of each 
of the 20 amino acids at that position was calculated based on codon usage 
5 and % doping for that library. This was then compared to the actual 
frequency of occurrence of each amino acid at every position after four 
rounds of biopanning. Any amino acid that occurred at a frequency >2-fold 
was considered preferred. Most preferred amino acid(s) were those that 
have the greatest fold enrichment after panning. Preferred amino acid 
10 sequences for RP9, D8, and Formula 10 (Group 6) peptides are shown 
below. 



TABLE 17 



Peptide 


Sequence 


SEQ ID NO: 


RP9 


GS LDE S F YDWFERQLG 


1559 


Regular 


GLADEDFYEWFERQLR 
L 


2223 


w/ Peptide 


GQLDEDFYEWFDRQLS 
A 


2224 


w/ Insulin 


GFMDES FYEWFERQLR 
W A 


2225 



Table 17 shows preferred amino acid sequences for RP9 peptides. 

15 Residues in bold indicate strong preference; underlined residues indicate 
positions where more than one amino acid preference is seen. The first 
column indicates the conditions used for the panning procedure. "RP9" 
indicates sequence of the parent RP9; "Regular" indicates regular pan as 
described in methods for panning of random libraries; "w/ peptide" indicates 

20 panning in the presence of 2 nM RP9 peptide; "w/ insulin" indicates panning 
in the presence of 2 nM insulin. 
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TABLE 18 



Peptide 


Sequence 


SEQ ID NO: 


D8 Parent: 


WLDQEWAWVQCEVYGRGC PS 


2129 


Dimer Consensus 


sLEEEWaQIECEVY/WGRGCps 


2226 


Monomer Consensus 


sLEEEWaQIqCEIY/WGRGCry 
W 


1548 



Table 18 shows preferred amino acid sequences for D8 peptides. 
Upper case residues in bold indicate strong preference (>90% frequency); 
5 upper case letters, non-bold, indicate some preference (5-15% higher 
frequency than expected); lower case letters indicate less preference (2-5% 
higher frequency than expected); similar preferences seen in D8 in both 
monomer and dimer libraries. The underlined Y/W indicates that both 
residues are equally preferred at that position. In the original D8 sequence 
1 0 that position is occupied by Y. 



TABLE 19 



Peptide 


Sequence 


Type 


SEQ ID NO: 


Group 6 


W(A/E)GYEW(F/L) 


preferred core 


1549 


Group 6 


DSDWAGYEWFEEQLD 


preferred sequence 


1595 



Table 19 shows preferred amino acid sequences for Group 6 
peptides. Underlined residues indicate preferred N-terminal and C-terminal 
15 extensions. 

Example 12: Fluorescence-Based HIGF-1R Binding Assays 

A. Heterogeneous Time-Resolved Fluorometric Assays 

The effect of recombinant peptide surrogate G33 (rG33) on the 
binding of biotinylated-recombinant human IGF-1 (b-rhlGF-1) to recombinant 
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human IGF-1R (rhlGF-1R) was determined using heterogeneous time- 
resolved fluorometric assays (TRF; DELFIA®, PE Wallac, Inc.). The rhlGF- 
1R protein included the extracellular domain of the receptor pre-propeptide, 
up to amino acid residue 932 (A. Ullrich et a/., 1986, EMBO J. 5:2503-2512). 
5 Duplicate data points were collected at each concentration of competitor and 
the lines were designed to represent the best fit to a four-parameter non- 
linear regression analysis (y = min + (max-min)/(1+10 A ((loglC 5 o- 
x)*Hillslope))) of the data, which was used to determine IC 5 o values. 

The assay was performed using a 96-well clear microplate (NUNC 

10 MaxiSorp) with a final volume of 100 jJ. Microtiter plates were coated with 
0.1 ^ig rhlGF-1R in 100 \i\ of NaHC0 3 , pH 8.5 buffer, and incubated 
overnight at room temperature (RT). The plates were washed 3-times with 
0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M KCI (TBS). This 
was followed by addition of 200 jil blocking buffer (TBS containing 0.05% 

15 Bovine Serum Albumin (BSA, Cohn Fraction V)), and incubated for 1 hr at 
RT. The plates were washed 6-times with a 1 X solution of Wallac's 
DELFIA® wash concentrate. Competitor was added in a volume of 50 ^l and 
serially diluted across the microtiter plate in TBS containing 0.05% BSA. 
Non-specific binding (background) was determined in the presence of 60 ^iM 

20 hlGF-1. 

Fifty microliters of b-rhlGF-1, 10 nM, diluted in TBS containing 0.05% 
BSA was added. The plates were incubated for 2 hr at RT. After 
incubation, plates were washed 6-times with a 1X solution of Wallac's 
DELFIA® wash concentrate. Then the plates were treated with 100 pt- of 

25 Wallac's DELFIA® Assay Buffer containing a 1:1000 dilution of europium- 
labeled streptavidin and incubated for 2 hours at RT. This was followed by 
washing 6-times with a 1 X solution of Wallac's DELFIA® wash concentrate. 
One hundred microliters of Wallac's DELFIA® enhancer was added, and the 
plates were shaken for 30 min at RT. After shaking, the fluorescence signal 

30 at 620 nrri was read on a Victor 2 1420 plate reader (PE Wallac, Inc.). 
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Primary data were background corrected, normalized to buffer controls, and 
then expressed as % Specific Binding. The Z-factor was greater than 0.5 
(Z' = 1-(3a + +3a.)/||x + -(x.|; Zhang ef a/., 1999, J. Biomol. Screen. 4:67-73) and 
the signal-to-background (S/B) ratio was ~20. The results of these 
5 experiments are shown in Figure 45. The IC50 value calculated for rG33 is 
shown in Table 20, below. 

The effect of recombinant peptide surrogates D815 (rD815), RP9, 
D815-6aa-G33, D815-6aa-RP9, and D815-12aa-RP9 on the binding of b- 
rhlGF-1 to rhlGF-1R was determined using the fluorometric assay described 

10 above. IGF-1 was used as a control. Duplicate data points were collected 
at each concentration of competitor and the lines represent the best fit to a 
four-parameter non-linear regression analysis, which was used to determine 
IC50 values. Results for rD815 are show in Figure 46; results for RP9 are 
shown in Figure 47; results for D815-6-G33 are shown in Figure 48; results 

15 for D815-6-RP9 are shown in Figure 49; and results for D815-12-RP9 are 
shown in Figure 50; the results for IGF-1 are shown in Figure 51. The IC 50 
values for the rD815, RP9, D815-6aa-G33, D81 5-6aa-RP9, and D815-12aa- 
RP9 peptides, and IGF-1 are shown in Table 20, below. Linker sequences 
are underlined. 

20 TABLE 20 



Competitor 


Sequence 


SEQ ID 

NO: 


ICsotM) 


rG33 


GIISQSCPESFYDWFAGQVSDPWWCW 


1600 


1.45 x 10"* M 


rD815 


WLDQERAWLWCEISGRGCLS 


2206 


4.08x10~°M 


RP9 


GSLDESFYDWFERQLG 


1559 


4.17 x 10"' M 


D815-6aa-G33 


WLDQERAWLWCEISGRGCLSGGSGGSGIIS 
QSCPESFYDWFAGQVSDPWWCW 


2210 


6.24 x 10" M 


D815-6aa-RP9 


WLDQERAWLWCEISGRGCLSGGSGGSGSL 
DESFYDWFERQLGKK 


2211 


3.57 x 10** M 


D815-12aa-RP9 


WLDQERAWLWCEISGRGCLSGGSGGSGG 
SGGSGSLDESFYDWFERQLGKK 


2212 


3.22 x 10* M 


IGF-1 






6.85 x 10* 1o M 
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The order of potency of all peptides or dinners compared to IGF-1 was 
determined as: IGF-1 > D815-12aa-RP9 » D815-6aa-RP9 > RP9 = D815- 
6aa-G33 > rG33 > rD815. These results suggest that the coupling of D815 
with RP9 using an extended linker (12 versus 6 amino acids) produced a 
5 potent competitor that approximates the affinity of IGF-1 for its own receptor. 

B. Time-Resolved Fluorescence Resonance Energy Transfer 
Assays 

The effect of Site 1 peptide surrogates, Site 2 peptide surrogates, and 

10 rhlGF-1 on the dissociation of biotinylated-20E2 (b-20E2, Site 1) from 
recombinant human IGF-1 R was determined using time-resolved 
fluorescence resonance energy transfer assays (TR-FRET). Best fit non- 
linear regression analysis of the data, was used to determine dissociation 
rate constants. Each data point represents a single observation. 

15 The assay was performed using a 96-well white microplate (NUNC) 

with a final volume of 100 nl. Final incubation conditions were 16.5 nM b- 
20E2, 2.2 nM SA-APC (streptavidin-allophycocyanin), 2.2 nM Eu 3+ -rhlGF-1R 
(LANCE™ labeled, PE Wallac, Inc.), 0.05 M Tris-HCI (pH 8 at 25°C), 0.138 
M NaCI, 0.0027 M KCI, and 0.1 % BSA (Cohn Fraction V). Reactions were 

20 allowed to reach equilibrium for 6 hr at RT. Following this, various peptide 
surrogates or IGF-1 were added at a final concentration of 100 |iM or 30 ^iM, 
respectively. The addition of peptides or IGF-1 initiated the measurement of 
dissociation (Time Zero, sec). The fluorescence signal at 665 nm was read 
on a Victor 2 1420 plate reader (PE Wallac, Inc.) at 30 sec intervals. 

25 Results of these experiments are shown in Figure 52. The buffer 

controls did not vary over the time interval of study, which demonstrated that 
the equilibrium was not disturbed by the addition of diluent at Time zero. 
The addition of excess (> 1000-fold 20E2 K d for IGF-1 R) Site 1 peptides 
such as H2C, 20E2, or RP6 did not differ depending on specific the peptide 

30 used, and the dissociation rates of b-20E2 were similar for these peptides. 
D8B12 (Site 2 peptide) and IGF-1 (binds both Site 1 and Site 2) did 
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demonstrate significant differences in the rate of dissociation of b-20E2. 
This would suggest that these agents act as non-competitive or allosteric 
regulators of Site 1 binding. 

The effect of various peptide surrogates or peptide dimers on the 
5 binding of biotinylated-20E2 (B-20E2) to recombinant human IGF-1R was 
determined using TR-FRET assays, described above. For these 
experiments, each data point represents the average of two replicate wells. 
The lines represent the best fit to a four-parameter non-linear regression 
analysis (y = min + (max-min)/(1+10 A ((loglC 5 o-x)*Hillslope))) of the data, 

10 which was used to determine IC 50 values. 

The assays were performed using a 384-well white microplate 
(NUNC) with a final volume of 30 til. Final incubation conditions were 15 nM 
b-20E2, 2 nM SA-APC, 2 nM Eu 3+ -rhlGF-1 R (LANCE™ labeled, PE Wallac, 
Inc.), 0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M KCI. and 0.1 

15 % BSA (Cohn Fraction V). After 16-24 hr of incubation at RT, the 
fluorescence signal at 665 nm and 620 nm was read on a Victor 2 1420 plate 
reader (PE Wallac, Inc.). Primary data were background corrected, 
normalized to buffer controls, and then expressed as % Specific Binding. 
The Z'-factor was greater than 0.5 (Z = 1-(3a++3a.)/|^+-n-|; Zhang et al, 

20 1999, J. Biomoi Screen. 4:67-73) and the signal-to-background (S/B) ratio 
was ~ 4. Results of these experiments are shown in Figure 53. Table 21, 
below, shows the IC 5 o values calculated for these experiments. Notably, the 
C1 peptide showed IGF-1R affinities of -1 nM (Figure 53) and -10 nM 
(Table 21 ) in these assays. 

25 TABLE 21 



Competitor 


Sequence 


SEQ 
ID NO: 


Formula 


Site 
IGF-1 R 


IC5o(M) 


C1 


CWARPCGDAANFYDWFVQQAS 


1550 


1 


1 


8.80E-10 


IGF-1 










2.93E-09 


RP9 


GSLDESFYDWFERQLG 


1559 


1 


1 


3.93E-08 
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20E2 


DYKDFYDAIDQLVRGSARAGGTRD 


2209 


2 


1 


1.04E-07 


E8 


GGTVWPGYEWLRNA 


2118 


10 


2 


2.53E-07 


H2C 


FHENFYDWFVQRVSKK 


2117 


1 


1 


4.60E-07 


S173 


LDALDRLMRYFEERPSL 


1830 


3 


1 


6.29E-06 


D8B12 


WLEQERAWIWCEIQGSGCRA 


1884 


6 


2 


1.13E-05 


A6 


SAKNFYDWFVKK 


1551 


1 


1 


3.10E-05 



C. Fluorescence Polarization Assays 

The effect of various peptide monomers and dimers on the binding of 
fluorescein-RP-9 (FITC-RP9) to soluble human insulin receptor- 
immunoglobulin heavy chain chimera (sIR-Fc; Bass et a/., 1996, J. Biol. 
5 Chem. 271:19367-19375) was determined using fluorescence polarization 
assays (FP). For these experiments, each data point represents the 
average of two replicate wells. The lines represent the best fit to a four- 
parameter non-linear regression analysis of the data, which was used to 
determine IC 5 o values. 

10 The assays were performed in a 384-well black microplate (NUNC) 

with a final volume of 30 \x\. Final incubation conditions were 1 nM FITC- 
RP9, 10 nM sIR, 0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M 
KCI, 0.05 % BGG (bovine gamma globulin), 0.005 % Tween-20®. After 16- 
24 hr of incubation at RT, the fluorescence signal at 520 nm was read on an 

15 Analyst™ AD plate reader (LJL BioSystems, Inc.). Primary data were 
background corrected using 10 nM sIR without FITC-RP9 addition, 
normalized to buffer controls and then expressed as % Specific Binding. 
The Z'-factor was greater than 0.5 (Z' = 1-(3a + +3a.)/|fu-^-|; Zhang et al, 
1999, J. Biomol. Screen. 4:67-73) and the assay dynamic range was -125 

20 mP. In parallel with these experiments, TR-FRET assays were performed 
using rhlGF-1R and b-20E2, as described above. Results of the FP and 
TR-FRET experiments are shown in Table 22, below. 
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TABLE 22 



Peptide 


FP 

sIR-Fc 


TR-FRET 
itilGF-1 R 


Binding 
Ratio 
I6F-1R 
/IR 


Formula 


Site 
IGF-1R 


SEQID 

NO: 


Sequence 


RP4 


17 


8100 


476 


2 


1 


1552 


PPWGARFYDAIEQLVFDNL 


S175 


10 


1650 


165 


1 


1 


1560 


GRVDWLQRNANFYDWFVA 
ELG 


RP15 


28 


706 


25 


1 


1 


2130 


SQAG SAFYAWFDQVLRTV 


H2C(D117) 


66 


600 


9 


1 




2117 


FHENFYDWFVQRVSKK 


20E2 (D118) 


51 


100 


1.9 


2 




2209 


DYKDFYDAIDQLVRGSARA 
GGTRD 


RP9 


24 


33 


1.4 


1 




1559 


GSLDESFYDWFERQLG 


G33 


139 


178 


1.3 


1 




1600 


GIISQSCPESFYDWFAGQV 
SDPWWCW 


E8(D120) 


206 


175 


0.85 


10 


2 


2118 


GGTVWPGYEWLRNA 


C1 (D112) 


52 


10 


0.19 


1 


1 


1550 


CWARPCGDAANFYDWFV 
QQAS 


RP16 


6400 


961 


0.15 






1553 


VMDARDDPFYHKLSELVT 



FP sIR-Fc column shows IC50 (nM) values obtained (vs. FITC-RP9); TR-FRET rhlGF-1R 
column shows IC 50 (nM) values obtained (vs. b-20E2); for Binding Ratio: higher values 
5 indicated higher affinity for IR than IGF-1 R. 

These results demonstrated that S175, RP4, and RP15 showed high 
affinities for IR and showed high binding ratios for IGF-1 R over IR. H2C, 
20E2, RP9, and C1 were slightly less potent than S175, RP4, and RP15 at 
10 IR, and these peptides had lower binding ratios for IGF-1 R over IR. G33 
and E8 were less potent than S175, RP4, and RP15 at IR, and showed 
comparable binding to IGF-1 R and IR. RP16 had poor potency at IR and 
IGF-1 R, but had higher affinity for IGF-1 R than IR. 

Example 13: Insulin Receptor Surrogates with Enhanced Specificity 

15 Peptide S597 was tested for its bioactivity relative to insulin. SGBS 

cells (a human adipocyte cell line) were incubated with various 
concentrations of human insulin or peptide S597 and cellular uptake of 14 C- 
glucose was measured essentially as described in Example 4. The results 
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(as illustrated in Figure 54) indicate that the potency of S597 in stimulating 
glucose uptake is at least as good as that of human insulin. 

The glucose-lowering effect of peptide S597 and peptide S557 in rats 
was compared with that of insulin as follows: Eighteen male Wistar rats, 
5 200-225 g, fasted for 18 h, were anesthetized using Hypnorm-Dormicum 
(1.25 mg/ml Dormicum, 2.5 mg/ml fluanisone, 0.079 mg/ml fentanyl citrate) 
2 ml/kg as a priming dose 30 min prior to test substance dosing and 
additional 1 ml/kg every 20 minutes (at time points -10 min, 10 min and 30 
min relative to test substance dosing). 

10 The rats were allocated into three groups. The animals were dosed 

with an intravenous injection (tail vein), 2 ml/kg, of either human insulin 1.25 
nmol/kg (n=6) or S557 peptide 5 nmol/kg (n=6) or S597 peptide 5 nmol/kg 
(n=6). Blood samples for the determination of whole blood glucose 
concentration were collected in heparinized 10 pi glass tubes by puncture of 

15 the capillary vessels in the tail tip at times -20 min and 0 min (before 
dosing), and at times 10, 20, 30, 40, 60, 80, 120, and 180 min after dosing. 
Blood glucose concentrations were measured after dilution in analysis buffer 
by the immobilized glucose oxidase method using an EBIO Plus 
autoanalyzer (Eppendorf, Germany). 

20 The results (as illustrated in Figure 55) indicate that the blood glucose 

lowering effect of S597 in rats is about 4 times lower than that of human 
insulin. The improved effect of S597 relative to S557 shows the effect of N- 
terminal acetylation. 

The glucose-lowering effect of different concentrations of peptide 

25 S597 was also tested by intravenous administration to fasted Goettingen 
minipigs weighing about 15 kg. The results (as illustrated in Figure 56) 
indicate that the glucose-lowering effect at 3 nmol/kg S597 is comparable to 
that of 0.3 nmol/kg human insulin. 
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Example 14: Co-administration of Therapeutic Peptides 

The rate of disappearance of two co-administered peptides was 
tested as follows: 

Mixtures containing 600 nmol/ml peptide S557 and 1800 nmol/ml B 29 - 
5 N -(N-lithocolyl- -glutamyl)-des(B30) human insulin included 125 Uabeled 
peptides were injected into the neck of a pig. Radioactivity at the injection 
site was monitored over time using an external gamma counter. 

The results (as illustrated in Figure 57) indicate that the 
disappearance of either peptide was not influenced by the presence of the 
10 second peptide. 

Incorporated herein by reference in its entirety is the Sequence 
Listing for the application, comprising SEQ ID NO:1 to SEQ ID NO:2227. 
The Sequence Listing is disclosed on three CD-ROMs, designated "CRF", 
15 "Copy 1", and "Copy 2 M . The Sequence Listing is a computer-readable 
ASCII file named "18784057PC.app.txf, created on September 23, 2002, in 
IBM-PC machine format, on a MS-Windows®98 operating system. The 
18784057PC.app.txt file is 927,476 bytes in size. 

20 As various changes can be made in the above compositions and 

methods without departing from the scope and spirit of the invention, it is 
intended that all subject matter contained in the above description, shown in 
the accompanying drawings, or defined in the appended claims be 
interpreted as illustrative, and not in a limiting sense. 

25 

The contents of all patents, patent applications, published articles, 
books, reference manuals, texts and abstracts cited herein are hereby 
incorporated by reference in their entirety to more fully describe the state of 
the art to which the present invention pertains. 

30 
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WHAT IS CLAIMED IS: 

1. A method of decreasing insulin receptor activity in mammalian cells 
comprising administering to said cells an amino acid sequence in an 
amount sufficient to decrease insulin activity, wherein the amino acid 
sequence comprises a subsequence that comprises a sequence that 
binds to Site 1 of insulin receptor and a subsequence that comprises 
a sequence that binds to Site 2 of insulin receptor, and wherein the 
subsequences are linked C-terminus to N-terminus and the 
subsequences are oriented Site 1 to Site 2, with the proviso that the 
amino acid sequence is not insulin, insulin-like growth factor, or 
fragments thereof. 

2. The method according to claim 1, wherein the Site 1 sequence 
consists essentially of a Formula 1 sequence X^XaXiXsand the Site 
2 sequence consists essentially of a Formula 6 sequence )<62 X 6 3 X w 
X65 X66 Xe7 X68 Xeg X70 X71 X72 X73 X74 X75 X76 X77 X78 X79 Xso X81, and 
wherein Xi, X 2 , X4, and X 5 are aromatic amino acids, and X 3 is any 
polar amino acid, and wherein Xq 2 , Xes, Xee Xes, X 6 9, X 7i , X73, X 7 e, X77, 
X78, Xso and Xsi are any amino acid; X$3, X70, and X74 are 
hydrophobic amino acids; X w is a polar amino acid; X67 and X75 are 
aromatic amino acids; and X72 and X79 are cysteines. 

3. The method according to claim 2, wherein X 1t X 2 , and X 5 are selected 
from the group consisting of phenylalanine and tyrosine, X3 is 
selected from the group consisting of aspartic acid, glutamic acid, 
glycine and serine, and X4 is selected from group consisting of 
tryptophan, tyrosine and phenylalanine. 
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4. The method according to claim 2, wherein X$3 is selected from the 
group consisting of leucine, isoleucine, methionine and valine; X/o 
and X74 are selected from group consisting of valine, isoleucine, 
leucine and methionine; X64 is selected from group consisting of 
aspartic acid and glutamic acid; X67 is tryptophan; and X75 is selected 
from group consisting of tyrosine and tryptophan. 

5. The method according to claim 2, wherein the Formula 1 sequence 
^2X3X4X5 is FYDWF (SEQ ID NO:1554). 

6. The method according to claim 2, wherein the Formula 6 sequence 

X62 X63 X64 X65 X66 X67 X$8 X69 X70 X71 X72 X73 X74 X75 X76 X77 X78 X79 

Xeo Xsi is WLDQEWAWVQCEVYGRGCPS (SEQ ID NO:2129). 

7. The method according to claim 2, wherein the Formula 1 sequence 
is selected from the group consisting of sequences SEQ ID NOS:1- 
712 (Figures 1A-10); SEQ ID NOS:1221-1243 (Figure 8); and SEQ 
ID NOS:1596, 1718-1719, 1556, 1560, and 1720-1776 (Table 2). 

8. The method according to claim 2, wherein the Formula 6 sequence 
is selected from the group consisting of sequences SEQ ID NOS:926- 
1061 (Figures 3A-3E); SEQ ID NOS:1244-1253 (Figure 9A); and 
SEQ ID NOS:1596, 1718-1719, 1556, 1560, and 1720-1776 (Table 
2). 
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9. The method according to claim 2, wherein the Formula 1 sequence 
is selected from the group consisting of sequences S105-S116 (SEQ 
ID NOS:1791-1805, 1556, and 1806-1807), S131 (SEQ ID NO:1820), 
S137 (SEQ ID NO:1821), S158 (SEQ ID NO:1780), S165-S168 (SEQ 
ID NOS:1554, and 1824-1826), S171 (SEQ ID NO:1831), S175-S176 
(SEQ ID NOS:1560 and 1836), S179-S184 (SEQ ID NOS:1839- 
1844), S214-216 (SEQ ID NOS: 1845-1 847), S219-223 (SEQ ID 
NOS:1 852-1856), S227 (SEQ ID NO:1858), S234-245 (SEQ ID 
NOS: 1869-1 880), S248-S251 (SEQ ID NOS:1 883-1 886), S264-S265 
(SEQ ID NOS:1900-1901), S268 (SEQ ID NO:1903), S278 (SEQ ID 
NO:1905), S287 (SEQ ID NO:1911), S294-S295 (SEQ ID NOS:1922- 
1923), S315 (SEQ ID NO:1937), S319-322 (SEQ ID NOS:1940- 
1943), S326 (SEQ ID NO:1600), S342 (SEQ ID NO:1962), S365- 
S366 (SEQ ID NOS: 1987-1 988), S371-S373 (SEQ ID NOS:1558, 
and 1990-1991), S386-S403 (SEQ ID NOS:1559, 2005-2007, 1794, 
2008-2009, 1788, 1787, 1789, 2010-2011, 1791, and 2012-2014), 
RB437 (SEQ ID NO:2164), RB502 (SEQ ID NO:2170), RB452 (SEQ 
ID NO:2173), RB513 (SEQ ID NO:2176), RB464 (SEQ ID NO:2179), 
RB596 (SEQ ID NO:2202), RB569 (SEQ ID NO:2203), and RB570 
(SEQ ID NO:2204). 

10. The method according to claim 2, wherein the Formula 6 sequence 
is selected from the group consisting of sequences S256 (SEQ ID 
NO:1893), S263 (SEQ ID NO:1899), S266 (SEQ ID NO:1902), S284- 
285 (SEQ ID NOS:1909-1910), S515 (SEQ ID NO:2102), and RB426 
(SEQ ID NO:2158). 
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1 1 . The method according to claim 2, wherein the Formula 1 sequence 
is selected from the group consisting of sequences 

H2C/D117: 

FHENFYDWFVRQVSKK (SEQ ID NO:1556); 

FHENFYDWFVRQVS (SEQ ID NO:1557); 

RP9: GSLDESFYDWFERQLGKK (SEQ ID NO: 1558); 

GSLDESFYDWFERQLG (SEQ ID NO: 1559); 

GLADEDFYEWFERQLR (SEQ ID NO:1561); 

GLADELFYEWFDRQLS (SEQ ID NO:1562); 

GQLDEDFYEWFDRQLS (SEQ ID NO: 1563); 

GQLDEDFYAWFDRQLS (SEQ ID NO: 1564); 

GFMDESFYEWFERQLR (SEQ ID NO: 1565); 

GFWDESFYAWFERQLR (SEQ ID NO: 1566); 

GFMDESFYAWFERQLR (SEQ ID NO: 1567); 

GFWDESFYEWFERQLR (SEQ ID NO: 1568); 

RP15: SQAGSAFYAWFDQVLRTV (SEQ ID NO:2130); and 
S175: GRVDWLQRNANFYDWFVAELG (SEQ ID NO:1560). 

12. The method according to claim 2, wherein the Formula 6 sequence 
is a D8 sequence selected from the group consisting of: 

WLDQEWVQCEVYGRGCPSKK (SEQ ID 
NO:2227); KWLDQEWAWVQCEVYGRGCPSKK (SEQ ID 
NO:1 579); KWLDQEWAWVQCEVYGRGCPS (SEQ ID NO:1 580); 

SLEEEWAQIQCEIYGRGCRY (SEQ ID NO:1581); 

SLEEEWAQIQCEIWGRGCRY (SEQ ID NO:1582); 

SLEEEWAQIECEVYGRGCPS (SEQ ID NO:1583); and 

SLEEEWAQIECEVWGRGCPS (SEQ ID NO: 1584). 

13. The method according to claim 2, wherein the amino acid sequence 
is selected from the group consisting of sequences 537-538 (SEQ ID 
NOS:21 14-21 15). 
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14. The method according to claim 2, wherein the amino acid sequence 
is selected from the group consisting of sequences S425 (SEQ ID 
NO:2031), S454 (SEQ ID NOS:2058-2059), S459 (SEQ ID NO:2065), 
and RB537-RB538 (SEQ ID NOS:21 97-21 98). 

15. The method according to claim 1, wherein the Site 1 sequence 
consists essentially of a Formula 1 sequence X1X2X3X4X5 and the Site 
2 sequence consists essentially of a Formula 4 sequence 

X22X23X24X25X26X27X28X29X30X31X32X33X34X35X36X37X38X39X40X41 , and 

wherein X1, X 2 , X4, and X 5 are aromatic amino acids, and X 3 is any 
polar amino acid, and wherein X22, X25, X26, X28, X29, X30, X33, X34, 
X35, X37, X 38 , X40 and X41 are any amino acid; X23 is any hydrophobic 
amino acid; X27 is a polar amino acid; X31 is an aromatic amino acid; 
X32 is a small amino acid; and wherein at least one cysteine is located 
at positions X 2 4 through X27 and one at X 39 orX4o. 

16. The method according to claim 15, wherein X1, X 2 , and X 5 are 
selected from the group consisting of phenylalanine and tyrosine, X3 
is selected from the group consisting of aspartic acid, glutamic acid, 
glycine and serine, and X4 is selected from group consisting of 
tryptophan, tyrosine and phenylalanine. 

17. The method according to claim 15, wherein X 2 4 and X39 are cysteines, 
X23 is selected from leucine, isoleucine, methionine and valine; X 2 7 is 
selected from glutamic acid, aspartic acid, asparagine, and 
glutamine; X31 is tryptophan, X32 is glycine; and X36 is any aromatic 
amino acid. 

18. The method according to claim 15, wherein the Formula 1 sequence 
XiX 2 X 3 X4X 5 is FYDWF (SEQ ID NO:1554). 

19. The method according to claim 15, wherein the Formula 4 sequence 
X22X23X24X25X26X27X28X29X30X31X32X33 X34X35X36X37X38X39X40X41 is 
HLCVLEELFWGASLFGYCSG (SEQ ID NO: 1576). 
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The method according to claim 15, wherein the Formula 1 sequence 
is selected from the group consisting of sequences SEQ ID NOS:1- 
712 (Figures 1A-10); SEQ ID NOS:1 221 -1243 (Figure 8); and SEQ 
ID NOS:1596, 1718-1719, 1556, 1560, and 1720-1776 (Table 2). 

The method according to claim 15, wherein the Formula 4 sequence 
is selected from the group consisting of sequences SEQ ID NOS:713- 
925 (Figures 2A-2E); SEQ ID NOS: 1254-1 261 (Figure 9B); and SEQ 
ID NOS:1596, 1718-1719, 1556, 1560, and 1720-1776 (Table 2). 

The method according to claim 15, wherein the Formula 1 sequence 
is selected from the group consisting of sequences S105-S116 (SEQ 
ID NOS:1791-1805, 1556, and 1806-1807), S131 (SEQ ID NO:1820), 
S137 (SEQ ID NO:1821), S158 (SEQ ID NO:1780), S165-S168 (SEQ 
ID NOS:1554, and 1824-1826), S171 (SEQ ID NO:1831), S175-S176 
(SEQ ID NOS:1560 and 1836), S179-S184 (SEQ ID NOS:1839- 
1844), S214-216 (SEQ ID NOS: 1845-1 847), S219-223 (SEQ ID 
NOS: 1852-1 856), S227 (SEQ ID NO: 1858), S234-245 (SEQ ID 
NOS: 1869-1880), S248-S251 (SEQ ID NOS:1 883-1 886), S264-S265 
(SEQ ID NOS:1 900-1 901), S268 (SEQ ID NO:1903), S278 (SEQ ID 
NO:1905), S287 (SEQ ID NO:1911), S294-S295 (SEQ ID NOS:1922- 
1923), S315 (SEQ ID NO:1937), S319-322 (SEQ ID NOS:1940- 
1943), S326 (SEQ ID NO: 1600), S342 (SEQ ID NO: 1962). S365- 
S366 (SEQ ID NOS:1987-1988), S371-S373 (SEQ ID NOS:1558, 
and 1990-1991), S386-S403 (SEQ ID NOS:1559, 2005-2007, 1794, 
2008-2009, 1788, 1787, 1789, 2010-2011, 1791, and 2012-2014), 
RB437 (SEQ ID NO:2164), RB502 (SEQ ID NO:2170AA), RB452 
(SEQ ID NO:2173), RB513 (SEQ ID NO:2176), RB464 (SEQ ID 
NO:2179), RB596 (SEQ ID NO:2202), RB569 (SEQ ID NO:2203), 
and RB570 (SEQ ID NO:2204). 
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23. The method according to claim 15, wherein the Formula 4 sequence 
is selected from the group consisting of sequences S262 (SEQ ID 
NO:1898), S282-S283 (SEQ ID NOS:1 907-1 908), S331 (SEQ ID 
NO:1949), RB505M (SEQ ID NO:2160), RB446 (SEQ ID NO:2180), 
and RB505 (SEQ ID NO:2191). 

24. The method according to claim 15, wherein the Formula 4 sequence 
is selected from the group consisting of sequences RB517M (SEQ ID 
NO:2161), RB515 (SEQ ID NO:2162), and RB510 (SEQ ID 
NO:2163). 

25. The method according to claim 15. wherein the Formula 1 sequence 
is selected from the group consisting of sequences 

H2C/D117: 

FH ENF YDWFVRQVSKK (SEQ ID NO:1556); 

FHENFYDWFVRQVS (SEQ ID NO:1557); 

RP9: GSLDESFYDWFERQLGKK (SEQ ID NO: 1558); 

GSLDESFYDWFERQLG (SEQ ID NO:1559); 

GLADEDFYEWFERQLR (SEQ ID NO:1561); 

GLADELFYEWFDRQLS (SEQ ID N0.1562); 

GQLDEDFYEWFDRQLS (SEQ ID NO:1563); 

GQLDEDFYAWFDRQLS (SEQ ID NO:1564); 

GFMDESFYEWFERQLR (SEQ ID NO:1565); 

GFWDESFYAWFERQLR (SEQ ID NO:1566); 

GFMDESFYAWFERQLR (SEQ ID NO: 1567); 

GFWDESFYEWFERQLR (SEQ ID NO:1568); 

RP15: SQAGSAFYAWFDQVLRTV (SEQ ID NO:2130); and 
S175: GRVDWLQRNANFYDWFVAELG (SEQ ID NO:1560). 

26. The method according to claim 15, wherein the amino acid sequence 
is selected from the group consisting of sequences 431-433 (SEQ ID 
NOS:21 35-2137). 
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27. The method according to claim 15, wherein the amino acid sequence 
is selected from the group consisting of sequences RB431-RB433 
(SEQ ID NOS:2184, 2186 and 2188). 



28. A method of increasing insulin receptor activity in mammalian cells 
comprising administering to said cells an amino acid sequence in an 
amount sufficient to increase insulin activity, wherein the amino acid 
sequence comprises a subsequence that comprises a sequence that 
binds to Site 1 of insulin receptor and a subsequence that comprises 
a sequence that binds to Site 2 of insulin receptor, and wherein the 
subsequences are linked C-terminus to N-terminus and the 
subsequences are oriented Site 2 to Site 1, with the proviso that the 
amino acid sequence is not insulin, insulin-like growth factor, or 
fragments thereof. 

29. The method according to claim 28, wherein the Site 1 sequence 
consists essentially of a Formula 1 sequence X 1 X2X 3 X 4 X5 and the Site 
2 sequence consists essentially of a Formula 6 sequence X 6 2 Xe3 X w 
X65 X66 Xe7 X68 X69 X70 X71 X72 X73 X74 X75 X76 X77 X78 X79 Xqq X 8 i , and 
wherein Xl X 2t X4, and X 5 are aromatic amino acids, and X 3 is any 
polar amino acid, and wherein Xe2, Xes, X 6 e Xes, Xe9, X71, X73, X 76 , X 77 , 
X 7 8, X 8 o and X 8 i are any amino acid; X63, X 7 o, and X74 are 
hydrophobic amino acids; X64 is a polar amino acid; X67 and X 75 are 
aromatic amino acids; and X72 and X 79 are cysteines. 

30. The method according to claim 29, wherein X 1f X 2 , and X 5 are 
selected from the group consisting of phenylalanine and tyrosine, X 3 
is selected from the group consisting of aspartic acid, glutamic acid, 
glycine and serine, and X4 is selected from group consisting of 
tryptophan, tyrosine and phenylalanine. 
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31. The method according to claim 29, wherein X63 is selected from the 
group consisting of leucine, isoleucine, methionine and valine; X?o 
and X74 are selected from group consisting of valine, isoleucine, 
leucine and methionine; X<54 is selected from group consisting of 
aspartic acid and glutamic acid; Xe7 is tryptophan; and X75 is selected 
from group consisting of tyrosine and tryptophan. 

32. The method according to claim 29, wherein the Formula 1 sequence 
XiXzXaXAXsis FYDWF (SEQ ID NO:1554). 

33. The method according to claim 29, wherein the Formula 6 sequence 

X62 X63 X64 X65 X66 X67 X68 Xg9 X70 X71 X72 X73 X74 X75 X76 X77 X78 X79 

XaoXsi is WLDQEWAWVQCEVYGRGCPS (SEQ ID NO:2129). 

34. The method according to claim 29, wherein the Formula 1 sequence 
is selected from the group consisting of sequences SEQ ID NOS:1- 
712 (Figures 1A-10); SEQ ID NOS:1221-1243 (Figure 8); and SEQ 
ID NOS:1596, 1718-1719, 1556, 1560, and 1720-1776 (Table 2). 

35. The method according to claim 29, wherein the Formula 6 sequence 
is selected from the group consisting of sequences SEQ ID NOS:926- 
1061 (Figures 3A-3E); SEQ ID NOS:1244-1253 (Figure 9A); and 
SEQ ID NOS:1596, 1718-1719, 1556, 1560, and 1720-1776 (Table 
2). 
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is selected from the group consisting of sequences S105-S116 (SEQ 
ID NOS:1791-1805, 1556, and 1806-1807), S131 (SEQ ID NO:1820), 
S137 (SEQ ID NO:1821), S158 (SEQ ID NO:1780), S165-S168 (SEQ 
ID NOS:1554, and 1824-1826), S171 (SEQ ID NO:1831), S175-S176 
(SEQ ID NOS:1560 and 1836), S179-S184 (SEQ ID NOS:1839- 
1844), S214-216 (SEQ ID NOS: 1845-1 847), S219-223 (SEQ ID 
NOS: 1852-1 856), S227 (SEQ ID NO:1858), S234-245 (SEQ ID 
NOS: 1869-1 880), S248-S251 (SEQ ID NOS:1 883-1 886), S264-S265 
(SEQ ID NOS:1 990-1 991), S268 (SEQ ID NO:1903), S278 (SEQ ID 
NO:1905), S287 (SEQ ID NO:1911), S294-S295 (SEQ ID NOS:1922- 
1923), S315 (SEQ ID NO:1937), S319-322 (SEQ ID NOS:1940- 
1943), S326 (SEQ ID NO:1600), S342 (SEQ ID NO: 1962), S365- 
S366 (SEQ ID NOS:1 987-1 988), S371-S373 (SEQ ID NOS:1558, 
1900-1901), S386-S403 (SEQ ID NOS:1559, 2005-2007, 1794, 2008- 
2009, 1788, 1787, 1789, 2010-2011, 1791, and 2012-2014), RB437 
(SEQ ID NO:2164), RB502 (SEQ ID NO:2170), RB452 (SEQ ID 
NO:2173), RB513 (SEQ ID NO:2176), RB464 (SEQ ID NO:2179), 
RB596 (SEQ ID NO:2202), RB569 (SEQ ID NO:2203), and RB570 
(SEQ ID NO:2204). 

37. The method according to claim 29, wherein the Formula 6 sequence 
is selected from the group consisting of sequences S256 (SEQ ID 
NO:1893), S263 (SEQ ID NO:1899), S266 (SEQ ID NO:1902), S284- 
285 (SEQ ID NOS:1909-1910), S515 (SEQ ID NO:2102), and RB426 
(SEQ ID NO:2158). 
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38. The method according to claim 29, wherein the Formula 1 sequence 
is selected from the group consisting of sequences 

H2C/D117: 

FHENFYDWFVRQVSKK (SEQ ID NO:1556); 

FHENFYDWFVRQVS (SEQ ID NO: 1557); 

RP9: GSLDESFYDWFERQLGKK (SEQ ID NO: 1558); 

GSLDESFYDWFERQLG (SEQ ID NO:1559); 

GLADEDFYEWFERQLR (SEQ ID NO:1561); 

GLADELFYEWFDRQLS (SEQ ID NO:1562); 

GQLDEDFYEWFDRQLS (SEQ ID NO:1563); 

GQLDEDFYAWFDRQLS (SEQ ID NO: 1564); 

GFMDESFYEWFERQLR (SEQ ID NO:1565); 

GFWDESFYAWFERQLR (SEQ ID NO:1566); 

GFMDESFYAWFERQLR (SEQ ID NO:1567); 

GFWDESFYEWFERQLR (SEQ ID NO:1568); 

RP15: SQAGSAFYAWFDQVLRTV (SEQ ID NO:2130); and 
S175: GRVDWLQRNANFYDWFVAELG (SEQ ID NO:1560). 

39. The method according to claim 29, wherein the Formula 6 sequence 
is a D8 sequence selected from the group consisting of: 

WLDQEWVQCEVYGRGCPSKK (SEQ ID 
NO:2227); KWLDQEWAWVQCEVYGRGCPSKK (SEQ ID 
NO: 1 579); KWLDQEWAWVQCEVYGRGCPS (SEQ ID NO:1 580); 

SLEEEWAQIQCEIYGRGCRY (SEQ ID NO:1581); 

SLEEEWAQIQCEIWGRGCRY (SEQ ID NO:1582); 

SLEEEWAQIECEVYGRGCPS (SEQ ID NO:1583); and 

SLEEEWAQIECEVWGRGCPS (SEQ ID NO: 1584). 

40. The method according to claim 29, wherein the amino acid sequence 
is sequence 539 (SEQ ID NO:21 16). 
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41. The method according to claim 29, wherein the amino acid sequence 
is selected from the group consisting of sequences RP27 (SEQ ID 
NO:2213), RP28 (SEQ ID NO:2214), RP29 (SEQ ID NO:2215), RP30 
(SEQ ID NO:2216), RP31 (SEQ ID NO:2217), RP32 (SEQ ID 
NO:2218), RP33 (SEQ ID NO:2219), RP34 (SEQ ID NO:2220), RP35 
(SEQ ID NO:2221), and RP36 (SEQ ID NO:2222). 

42. The method according to claim 29, wherein the amino acid sequence 
is selected from the group consisting of sequences D8-6aa-S175 
(SEQ ID NO:2121), D8-12aa-S175 (SEQ ID NO:2122), D8-6aa-RP6 
(SEQ ID NO:2126), and D8-6aa-RP17 (SEQ ID NO:2127). 

43. The method according to claim 29, wherein the amino acid sequence 
is selected from the group consisting of sequences S429 (SEQ ID 
NO:2032), S455 (SEQ ID NO:2060), S457-S458 (SEQ ID NOS:2063- 
2064), S467-S468 (SEQ ID NOS:2066-2067), S471 (SEQ ID 
NO:2068), S481-S513 (SEQ ID NOS:2069-2101), S517-S520 (SEQ 
ID NOS:2104-2107), S524 (SEQ ID NO:2111), RB539 (SEQ ID 
NO:2196), RB625-RB626 (SEQ ID NOS:2200 and 2199), and RB622 
(SEQIDNO:2201). 

44. The method according to claim 28, wherein the Site 1 sequence 
consists essentially of a Formula 1 sequence X1X2X3X4X5 and the Site 
2 sequence consists essentially of a Formula 4 sequence 
X22X23X24X25X26X27X28X29X30X3 1X32X33X34X35 X36X37X38X39X40X41 , and 
wherein Xi, X 2 , X4, and X 5 are aromatic amino acids, and X3 is any 
polar amino acid, and wherein X22, X25, X 2 6, X 2 s, X29, X 30 , X33. X34, 
X35. X37, X38, X40 and X41 are any amino acid; X23 is any hydrophobic 
amino acid; X27 is a polar amino acid; X31 is an aromatic amino acid; 
X32 is a small amino acid; and wherein at least one cysteine is located 
at positions X24 through X27 and one at X39 orXio. 
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45. The method according to claim 44, wherein Xi, X 2 , and X 5 are 
selected from the group consisting of phenylalanine and tyrosine, X 3 
is selected from the group consisting of aspartic acid, glutamic acid, 
glycine and serine, and X* is selected from group consisting of 
tryptophan, tyrosine and phenylalanine. 

46. The method according to claim 45, wherein X 2 4 and X39 are cysteines, 
X23 is selected from leucine, isoleucine, methionine and valine; X27 is 
selected from glutamic acid, aspartic acid, asparagine, and 
glutamine; X31 is tryptophan, X32 is glycine; and X36 is any aromatic 
amino acid. 

47. The method according to claim 45, wherein the Formula 1 sequence 
X 1 X 2 X 3 X4X5is FYDWF (SEQ ID NO:1554). 

48. The method according to claim 45, wherein the Formula 4 sequence 

X22X23X24X25X26X27X28X29X30X31X32X33 X34X35X 3 sX37X38X39X4oX4i IS 

HLCVLEELFWGASLFGYCSG (SEQ ID NO:1576). 

49. The method according to claim 45, wherein the Formula 1 sequence 
is selected from the group consisting of sequences SEQ ID NOS:1- 
712 (Figures 1A-10); SEQ ID NOS:1221-1243 (Figure 8); and SEQ 
ID NOS:1596, 1718-1719, 1556, 1560, and 1720-1776 (Table 2). 

50. The method according to claim 45, wherein the Formula 4 sequence 
is selected from the group consisting of sequences SEQ ID NOS:713- 
925 (Figures 2A-2E); SEQ ID NOS:1254-1261 (Figure 9B); and SEQ 
ID NOS:1596, 1718-1719, 1556, 1560, and 1720-1776 (Table 2). 
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51. The method according to claim 45, wherein the Formula 1 sequence 
is selected from the group consisting of sequences S105-S116 (SEQ 
ID NOS:1 791-1805, 1556, and 1806-1807), S131 (SEQ ID NO:1820), 
S137 (SEQ ID NO:1821), S158 (SEQ ID NO:1780), S165-S168 (SEQ 
ID NOS:1554, and 1824-1826), S171 (SEQ ID NO:1831), S175-S176 
(SEQ ID NOS:1560 and 1836), S179-S184 (SEQ ID NOS:1839- 
1844), S214-216 (SEQ ID NOS: 1845-1 847), S219-223 (SEQ ID 
NOS: 1852-1856), S227 (SEQ ID NO:1858), S234-245 (SEQ ID 
NOS: 1869-1 880), S248-S251 (SEQ ID NOS:1 883-1 886), S264-S265 
(SEQ ID NOS:1 900-1 901), S268 (SEQ ID NO:1903), S278 (SEQ ID 
NO:1905), S287 (SEQ ID NO:1911), S294-S295 (SEQ ID NOS:1922- 
1923), S315 (SEQ ID NO:1937), S319-322 (SEQ ID NOS:1940- 
1943), S326 (SEQ ID NO:1600), S342 (SEQ ID NO:1962), S365- 
S366 (SEQ ID NOS:1987-1988), S371-S373 (SEQ ID NOS:1558, 
and 1990-1991), S386-S403 (SEQ ID NOS: 1559, 2005-2007, 1794, 
2008-2009, 1788, 1787, 1789, 2010-2011, 1791, and 2012-2014), 
RB437 (SEQ ID NO:2164), RB502 (SEQ ID NO:2170), RB452 (SEQ 
ID NO:2173), RB513 (SEQ ID NO:2176), RB464 (SEQ ID NO:2179), 
RB596 (SEQ ID NO:2202), RB569 (SEQ ID NO:2203), and RB570 
(SEQ ID NO:204). 

52. The method according to claim 45, wherein the Formula 4 sequence 
is selected from the group consisting of sequences S262 (SEQ ID 
NO:1898), S282-S283 (SEQ ID NOS: 1907-1 908), S331 (SEQ ID 
NO:1949), RB505M (SEQ ID NO:2160), RB446 (SEQ ID NO:2180), 
and RB505 (SEQ ID NO:2191). 

53. The method according to claim 45, wherein the Formula 4 sequence 
is selected from the group consisting of sequences RB517M (SEQ ID 
NO:2161), RB515 (SEQ ID NO:2162), and RB510 (SEQ ID 
NO:2163). 
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54. The method according to claim 45, wherein the Formula 1 sequence 
is selected from the group consisting of sequences 

H2C/D1 17: FHENFYDWFVRQVSKK (SEQ ID NO:1556); 
FHENFYDWFVRQVS (SEQ ID NO:1557); 
RP9: GSLDESFYDWFERQLGKK (SEQ ID NO:1558); 
GSLDESFYDWFERQLG (SEQ ID NO:1559); 
GLADEDFYEWFERQLR (SEQ ID NO:1561); 
GLADELFYEWFDRQLS (SEQ ID NO:1562); 
GQLDEDFYEWFDRQLS (SEQ ID NO:1563); 
GQLDEDFYAWFDRQLS (SEQ ID NO: 1564); 
GFMDESFYEWFERQLR (SEQ ID NO:1565); 
GFWDESFYAWFERQLR (SEQ ID NO:1566); 
GFMDESFYAWFERQLR (SEQ ID NO: 1567); 
GFWDESFYEWFERQLR (SEQ ID NO:1568); 
RP15: SQAGSAFYAWFDQVLRTV (SEQ ID NO:2130); and 
S175: GRVDWLQRNANFYDWFVAELG (SEQ ID NO:1560). 

55. The method according to claim 45, wherein the amino acid sequence 
is selected from the group consisting of sequences F8-6aa-RP9 
(SEQ ID NO:2119), F8-12aa-RP9 (SEQ ID NO:2120), F8-6aa-S175 
(SEQ ID NO:2123), F8-12aa-S175 (SEQ ID NO:2124), S516 (SEQ ID 
NO:2103), S521 (SEQ ID NO:2108), and S522 (SEQ ID NO:2109). 

56. A method of increasing insulin receptor activity in mammalian cells 
comprising administering to said cells an amino acid sequence in an 
amount sufficient to increase insulin activity, wherein the amino acid 
sequence comprises a plurality of subsequences that each comprise 
a sequence that binds to Site 1 of insulin receptor, with the proviso 
that the amino acid sequence is not insulin, insulin-like growth factor, 
or fragments thereof. 
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57. The method according to claim 56, wherein the Site 1 sequence 
consists essentially of a Formula 1 sequence X1X2X3X4X5, and 
wherein X 1f X 2 , X4, and X 5 are aromatic amino acids, and X 3 is any 
polar amino acid. 

58. The method according to claim 57, wherein X1, X 2> and X 5 are 
selected from the group consisting of phenylalanine and tyrosine, X 3 
is selected from the group consisting of aspartic acid, glutamic acid, 
glycine and serine, and X4 is selected from group consisting of 
tryptophan, tyrosine and phenylalanine. 

59. The method according to claim 57, wherein the Formula 1 sequence 
XiX 2 X 3 X4X5is FYDWF (SEQ ID NO:1554). 

60. The method according to claim 57, wherein the Formula 1 sequence 
is selected from the group consisting of sequences SEQ ID NOS:1- 
712 (Figures 1A-10); SEQ ID NOS:1221-1243 (Figure 8); and SEQ 
ID NOS:1596, 1718-1719, 1556, 1560, and 1720-1776 (Table 2). 
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61 . The method according to claim 57, wherein the Formula 1 sequence 
is selected from the group consisting of sequences S105-S116 (SEQ 
ID NOS:1 791 -1805, 1556, and 1806-1807), S131 (SEQ ID NO: 1820), 
S137 (SEQ ID NO:1821), S158 (SEQ ID NO:1780). S165-S168 (SEQ 
ID NOS:1554, and 1824-1826), S171 (SEQ ID NO:1831), S175-S176 
(SEQ ID NOS:1560 and 1836).' S179-S184 (SEQ ID NOS:1839- 
1844), S214-216 (SEQ ID NOS: 1845-1 847), S219-223 (SEQ ID 
NOS: 1852-1 856), S227 (SEQ ID NO:1858), S234-245 (SEQ ID 
NOS: 1869-1880), S248-S251 (SEQ ID NOS: 1883-1 886). S264-S265 
(SEQ ID NOS:1900-1901), S268 (SEQ ID NO:1903). S278 (SEQ ID 
NO:1905), S287 (SEQ ID NO:1911), S294-S295 (SEQ ID NOS:1922- 
1923), S315 (SEQ ID NO:1937), S319-322 (SEQ ID NOS:1940- 
1943), S326 (SEQ ID NO:1600), S342(SEQ ID NO:1962), S365-S366 
(SEQ ID NOS:1 987-1 988), S371-S373 (SEQ ID NOS:1558, and 
1990-1991), S386-S403 (SEQ ID NOS: 1559, 2005-2007, 1794, 2008- 
2009, 1788, 1787, 1789, 2010-2011, 1791, and 2012-2014), RB437 
(SEQ ID NO:2164), RB502 (SEQ ID NO:2170), RB452 (SEQ ID 
NO:2173), RB513 (SEQ ID NO:2176), RB464 (SEQ ID NO:2179), 
RB596 (SEQ ID NO:2202), RB569 (SEQ ID NO:2203), and RB570 
(SEQ ID NO:2204). 
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62. The method according to claim 57, wherein the Formula 1 sequence 
is selected from the group consisting of sequences 

H2C/D117: 

FHENFYDWFVRQVSKK (SEQ ID NO:1556); 

FHENFYDWFVRQVS (SEQ ID NO:1557); 

RP9: GSLDESFYDWFERQLGKK (SEQ ID NO:1558); 

GSLDESFYDWFERQLG (SEQ ID NO:1559); 

GLADEDFYEWFERQLR (SEQ ID NO:1561); 

GLADELFYEWFDRQLS (SEQ ID NO:1562); 

GQLDEDFYEWFDRQLS (SEQ ID NO:1563); 

GQLDEDFYAWFDRQLS (SEQ ID NO:1564); 

GFMDESFYEWFERQLR (SEQ ID NO:1565); 

GFWDESFYAWFERQLR (SEQ ID NO:1566); 

GFMDESFYAWFERQLR (SEQ ID NO: 1567); 

GFWDESFYEWFERQLR (SEQ ID NO:1568); 

RP15: SQAGSAFYAWFDQVLRTV (SEQ ID NO:2130); and 
S175: GRVDWLQRNANFYDWFVAELG (SEQ ID NO:1560). 

63. The method according to claim 57, wherein the amino acid sequence 
is selected from the group consisting of sequences 521 (SEQ ID 
NO:2112) and 535 (SEQ ID NO:2113). 

64. The method according to claim 57, wherein the amino acid sequence 
is selected from the group consisting of sequences 434 (SEQ ID 
NO:2138), 436 (SEQ ID NO:2139), 427 (SEQ ID NO:2141), 435 
(SEQ ID NO:2142), 439 (SEQ ID NO:2143), 449 (SEQ ID NO:2144), 
and 463 (SEQ ID NO:2146). 
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65. The method according to claim 57, wherein the amino acid sequence 
is selected from the group consisting of sequences S128 (SEQ ID 
NOS:1817-1818), S145 (SEQ ID NOS:1 822-1 823), S169-S170 (SEQ 
ID NOS: 1827-1 830), S172 (SEQ ID NOS:1 832-1 833), S218 (SEQ ID 
NOS:1850-1851), S228 (SEQ ID NOS: 1859-1 860), S231-232 (SEQ 
ID NOS: 1863-1 866), S253 (SEQ ID NOS: 1889-1 890), S267 (SEQ ID 
NO:), S290-S293 (SEQ ID NOS:1914-1921), S300-S301 (SEQ ID 
NOS: 1924-1 927), S312 (SEQ ID NOS: 1933-1 934), S325 (SEQ ID 
NOS: 1944-1 945), S329 (SEQ ID NOS:1 947-1948), S332-S337 (SEQ 
ID NOS:1950-1961), S349-S354 (SEQ ID NOS: 1965-1 976), S359- 
S363 (SEQ ID NOS: 1977-1 986), S374-S376 (SEQ ID NOS:1992- 
1996), S378-S381 (SEQ ID NOS: 1997-2004), S414-S418 (SEQ ID 
NOS:201 5-2024), S420 (SEQ ID NOS:2027-2028), RB463 (SEQ ID 
NO:2165), RB439 (SEQ ID N0.2166), RB436 (SEQ ID NO:2167), 
RB449 (SEQ ID NO:2168), RB508M-RB509M (SEQ ID NOS:2171- 
2172), RB508-RB509 (SEQ ID NOS:2 189-21 90), RB521 (SEQ ID 
NO:2193), and RB535 (SEQ ID NO:2194). 

66. A method of increasing insulin receptor activity in mammalian cells 
comprising administering to said cells an amino acid sequence in an 
amount sufficient to increase insulin activity, wherein the amino acid 
sequence comprises a subsequence that comprises a sequence that 
binds to Site 1 of insulin receptor and a subsequence that comprises 
a sequence that binds to Site 2 of insulin receptor, wherein the 
subsequences are linked C-terminus to C-terminus or N-terminus to 
N-terminus, with the proviso that the amino acid sequence is not 
insulin, insulin-like growth factor, or fragments thereof. 
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67. The method according to claim 66, wherein the Site 1 sequence 
consists essentially of a Formula 1 sequence XiX 2 X3X4X 5 and the Site 
2 sequence consists essentially of a Formula 6 sequence X62 X 6 3 X64 
Xes X66 X67 X68 X69 X70 X71 X72 X73 X74 X75 X76 X77 X78 X79 Xso Xei, and 
wherein Xi, X2, X4, and X 5 are aromatic amino acids, and X3 is any 
polar amino acid, and wherein X 6 2, X 6 5, X66 X68, X69, X71, X73, X 76 , X77, 
X78. Xso and Xsi are any amino acid; Xe3, X70, and X74 are 
hydrophobic amino acids; X64 is a polar amino acid; X67 and X75 are 
aromatic amino acids; and X72 and X79 are cysteines. 

68. The method according to claim 67, wherein Xi, X2, and X 5 are 
selected from the group consisting of phenylalanine and tyrosine, X 3 
is selected from the group consisting of aspartic acid, glutamic acid, 
glycine and serine, and X4 is selected from group consisting of 
tryptophan, tyrosine and phenylalanine. 

69. The method according to claim 67, wherein X63 is selected from the 
group consisting of leucine, isoleucine, methionine and valine; X 7 o 
and X74 are selected from group consisting of valine, isoleucine, 
leucine and methionine; X64 is selected from group consisting of 
aspartic acid and glutamic acid; X 67 is tryptophan; and X75 is selected 
from group consisting of tyrosine and tryptophan. 

70. The method according to claim 67, wherein the Formula 1 sequence 
X 1 X2X 3 X 4 X5is FYDWF (SEQ ID NO:1554). 

71 . The method according to claim 67, wherein the Formula 6 sequence 

X62 X63 X64 X65 X66 X67 X69 X70. X71 X72 X73 X74 X75 X76 X77 X78 X79 

Xq 0 X 81 is WLDQEWAWVQCEVYGRGCPS (SEQ ID NO:2056). 

72. The method according to claim 67, wherein the Formula 1 sequence 
is selected from the group consisting of sequences SEQ ID NOS:1- 
712 (Figures 1A-10); SEQ ID NOS:1221-1243 (Figure 8); and SEQ 
ID NOS:1596, 1718-1719, 1556, 1560, and 1720-1776 (Table 2). 



WO 03/070747 



PCT/US02/30312 



- 174- 

73. The method according to claim 67, wherein the Formula 6 sequence 
is selected from the group consisting of sequences SEQ ID NOS:926- 
1061 (Figures 3A-3E); SEQ ID NOS:1244-1253 (Figure 9A); and 
SEQ ID NOS:1596, 1718-1719, 1556, 1560, and 1720-1776 (Table 
2). 

74. The method according to claim 67, wherein the Formula 1 sequence 
is selected from the group consisting of sequences S105-S1 16 (SEQ 
ID NOS:1791-1805, 1556, and 1806-1807), S131 (SEQ ID NO:1820), 
S137 (SEQ ID NO:1821), S158 (SEQ ID NO:1780), S165-S168 (SEQ 
ID NOS:1554, and 1824-1826), S171 (SEQ ID NO:1831), S175-S176 
(SEQ ID NOS:1560 and 1836), S179-S184 (SEQ ID NOS:1839- 
1844), S214-216 (SEQ ID NOS:1 845-1 847), S219-223 (SEQ ID 
NOS: 1852-1 856), S227 (SEQ ID NO: 1858), S234-245 (SEQ ID 
NOS:1 869-1 880), S248-S251 (SEQ ID NOS:1 883-1 886), S264-S265 
(SEQ ID NOS:1900-1901), S268 (SEQ ID NO:1903), S278 (SEQ ID 
NO:1905), S287 (SEQ ID NO:1911), S294-S295 (SEQ ID NOS:1922- 
1923), S315 (SEQ ID NO: 1937), S31 9-322 (SEQ ID NOS: 1940- 
1943), S326 (SEQ ID NO: 1600), S342 (SEQ ID NO: 1962), S365- 
S366 (SEQ ID NOS: 1987-1 988), S371-S373 (SEQ ID NOS:1558, 
and 1990-1991), S386-S403 (SEQ ID NOS:1559, 2005-2007, 1794, 
2008-2009, 1788, 1787, 1789, 2010-2011, 1791, and 2012-2014), 
RB437 (SEQ ID NO:2164), RB502 (SEQ ID NO:2170), RB452 (SEQ 
ID NO:2173), RB513 (SEQ ID NO:2176), RB464 (SEQ ID NO:2179), 
RB596 (SEQ ID NO:2202), RB569 (SEQ ID NO:2203), and RB570 
(SEQ ID NO:2204). 

75. The method according to claim 67, wherein the Formula 6 sequence 
is selected from the group consisting of sequences S256 (SEQ ID 
NO:1893), S263 (SEQ ID NO:1899), S266 (SEQ ID NO:1902), S284- 
285 (SEQ ID NOS: 1909-1 910), S515 (SEQ ID NO:2102) and RB426 
(SEQ IDNO:2158). 
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76. The method according to claim 67, wherein the Formula 1 sequence 
is selected from the group consisting of sequences 

H2C/D117: 

FHENFYDWFVRQVSKK (SEQ ID NO:1556); 

FHENFYDWFVRQVS (SEQ ID NO:1557); 

RP9: GSLDESFYDWFERQLGKK (SEQ ID NO: 1558); 

GSLDESFYDWFERQLG (SEQ ID NO:1559); 

GLADEDFYEWFERQLR (SEQ ID NO: 1561); 

GLADELFYEWFDRQLS (SEQ ID NO: 1562); 

GQLDEDFYEWFDRQLS (SEQ ID NO: 1563); 

GQLDEDFYAWFDRQLS (SEQ ID NO: 1564); 

GFMDESFYEWFERQLR (SEQ ID NO:1565); 

GFWDESFYAWFERQLR (SEQ ID NO:1566); 

GFMDESFYAWFERQLR (SEQ ID NO:1567); 

GFWDESFYEWFERQLR (SEQ ID NO:1568); 

RP15: SQAGSAFYAWFDQVLRTV (SEQ ID NO:2057); and 
S175: GRVDWLQRNANFYDWFVAELG (SEQ ID NO: 1560). 

77. The method according to claim 67, wherein the Formula 6 sequence 
is a D8 sequence selected from the group consisting of: 

WLDQEWVQCEVYGRGCPSKK (SEQ ID 
NO:2227); KWLDQEWAWVQCEVYGRGCPSKK (SEQ ID 
NO: 1 579); KWLDQEWAWVQCEVYGRGCPS (SEQ ID NO: 1 580); 

SLEEEWAQIQCEIYGRGCRY (SEQ ID NO:1581); 

SLEEEWAQIQCEIWGRGCRY (SEQ ID NO:1582); 

SLEEEWAQIECEVYGRGCPS (SEQ ID NO:1583); and 

SLEEEWAQIECEVWGRGCPS (SEQ ID NO:1584). 

78. The method according to claim 67, wherein the amino acid sequence 
is selected from the group consisting of sequences S432-S433 (SEQ 
ID NOS:2033-2036), S436-S445 (SEQ ID NOS:2037-2056), and 
S456 (SEQ ID NOS:2061-2062). 
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79. An amino acid sequence that is an insulin receptor agonist, which 
comprises a subsequence that comprises a sequence that binds to 
Site 1 of the insulin receptor and a subsequence that comprises a 
sequence that binds to Site 2 of the insulin receptor, wherein the 
subsequences are linked C-terminus to N-terminus and the 
subsequences are oriented Site 2 to Site 1 , with the proviso that the 
amino acid sequence is not insulin, insulin-like growth factor, or 
fragments thereof. 

80. The amino acid sequence according to claim 79, wherein the Site 1 
sequence consists essentially of a Formula 1 sequence X1X2X3X4X5 
and the Site 2 sequence consists essentially of a Formula 6 
sequence Xe2 X63 X64 X65 X66 X67 X68 X69 X70 X71 X72 X73 X74 X75 X76 
X77 X78 X79 Xso Xsi, wherein X 1f X2, X4, and X 5 are aromatic amino 
acids, and X3 is any polar amino acid, and wherein X 6 2, X65, X 6 e X68, 
X69, X71, X73, X76, X77, X78, Xeo and Xsi are any amino acid; X63, X7C 
and X 74 are hydrophobic amino acids; X64 is a polar amino acid; X67 
and X75 are aromatic amino acids; and X72 and X79 are cysteines. 

81. The amino acid sequence according to claim 80, wherein X it X 2 , and 
X 5 are selected from the group consisting of phenylalanine and 
tyrosine, X3 is selected from the group consisting of aspartic acid, 
glutamic acid, glycine and serine, and X4 is selected from group 
consisting of tryptophan, tyrosine and phenylalanine. 

82. The amino acid sequence according to claim 80, wherein X63 is 
selected from the group consisting of leucine, isoleucine, methionine 
and valine; X 7 o and X74 are selected from group consisting of valine, 
isoleucine, leucine and methionine; X64 is selected from group 
consisting of aspartic acid and glutamic acid; X67 is tryptophan; and 
X75 is selected from group consisting of tyrosine and tryptophan. 

83. The amino acid sequence according to claim 80, wherein the 
Formula 1 sequence XiX 2 X3X4X 5 is FYDWF (SEQ ID NO:1554). 
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84. The amino acid sequence according to claim 80, wherein the 
Formula 6 sequence X62 X63 X$4 X65 X66 Xe7 X68 X69 X70 X71 X72 X73 
X74 X 75 X 76 X77 X 78 X 79 Xao Xei is WLDQEWAWVQCEVYGRGCPS 
(SEQ ID NO:2056). 

85. The amino acid sequence according to claim 80, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences SEQ ID NOS:1-712 (Figures 1A-10); SEQ ID NOS:1221- 
1243 (Figure 8); and SEQ ID NOS:1596, 1718-1719, 1556, 1560, and 
1720-1776 (Table 2). 

86. The amino acid sequence according to claim 80, wherein the 
Formula 6 sequence is selected from the group consisting of 
sequences SEQ ID NOS:926-1061 (Figures 3A-3E); SEQ ID 
NOS:1244-1253 (Figure 9A); and SEQ ID NOS:1596, 1718-1719, 
1556, 1560, and 1720-1776 (Table 2). 
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87. The amino acid sequence according to claim 80, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences S105-S116 (SEQ ID NOS:1791-1805, 1556, and 1806- 
1807), S131 (SEQ ID NO:1820), S137 (SEQ ID NO:1821), S158 
(SEQ ID NO:1780), S165-S168 (SEQ ID NOS:1554, and 1824-1826), 
S171 (SEQ ID NO:1831), S175-S176 (SEQ ID NOS:1560 and 1836), 
S179-S184 (SEQ ID NOS: 1839-1 844), S214-216 (SEQ ID 
NOS: 1845-1 847), S21 9-223 (SEQ ID NOS:1 852-1 856), S227 (SEQ 
ID NO:1858), S234-245 (SEQ ID NOS:1869-1880), S248-S251 (SEQ 
ID NOS: 1883-1 886), S264-S265 (SEQ ID NOS: 1900-1 901), S268 
(SEQ ID NO:1903), S278 (SEQ ID NO:1905), S287 (SEQ ID 
NO:1911), S294-S295 (SEQ ID NOS: 1922-1 923), S315 (SEQ ID 
NO:1937), S319-322 (SEQ ID NOS: 1940-1 943), S326 (SEQ ID 
NO:1600), S342 (SEQ ID NO:1962), S365-S366 (SEQ ID NOS:1987- 
1988), S371-S373 (SEQ ID NOS: 1558, 1990-1991), S386-S403 
(SEQ ID NOS:1559, 2005-2007, 1794, 2008-2009, 1788, 1787, 1789, 
2010-2011, 1791, and 2012-2014), RB437 (SEQ ID NO:2164), 
RB502 (SEQ ID NO:2170), RB452 (SEQ ID NO:2173), RB513 (SEQ 
ID NO:2176), RB464 (SEQ ID NO:2179), RB596 (SEQ ID NO:2202), 
RB569 (SEQ ID NO:2203), and RB570 (SEQ ID NO:2204). 

88. The amino acid sequence according to claim 80, wherein the 
Formula 6 sequence is selected from the group consisting of 
sequences S256 (SEQ ID NO:1893), S263 (SEQ ID NO:1899), S266 
(SEQ ID NO:1902), S284-285 (SEQ ID NOS:1909-1910), S515 (SEQ 
ID NO:2102), and RB426 (SEQ ID NO:2158). 
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89. The amino acid sequence according to claim 80, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences H2C/D117: 
FHENFYDWFVRQVSKK (SEQ ID NO:1556); 

FHENFYDWFVRQVS (SEQ ID NO: 1557); 
RP9: GSLDESFYDWFERQLGKK (SEQ ID NO: 1558); 
GSLDESFYDWFERQLG (SEQ ID NO:1559); 
GLADEDFYEWFERQLR (SEQ ID NO:1561); 
GLADELFYEWFDRQLS (SEQ ID NO: 1562); 
GQLDEDFYEWFDRQLS (SEQ ID NO:1563); 
GQLDEDFYAWFDRQLS (SEQ ID NO: 1564); 
GFMDESFYEWFERQLR (SEQ ID NO:1565); 
GFWDESFYAWFERQLR (SEQ ID NO:1566); 
GFMDESFYAWFERQLR (SEQ ID NO:1567); 
GFWDESFYEWFERQLR (SEQ ID NO:1568); 
RP15: SQAGSAFYAWFDQVLRTV (SEQ ID NO:2057); and 
S175: GRVDWLQRNANFYDWFVAELG (SEQ ID NO:1560). 

90. The amino acid sequence according to claim 80, wherein the 
Formula 6 sequence is a D8 sequence selected from the group 
consisting of: WLDQEWVQCEVYGRGCPSKK 
(SEQ ID NO:2227); KWLDQEWAWVQCEVYGRGCPSKK (SEQ ID 
NO:1579); KWLDQEWAWVQCEVYGRGCPS (SEQ ID NO:1580); 

SLEEEWAQIQCEIYGRGCRY (SEQ ID NO: 1581); 
SLEEEWAQIQCEIWGRGCRY (SEQ ID NO:1582); 
SLEEEWAQIECEVYGRGCPS (SEQ ID NO:1583); and 
SLEEEWAQ I ECE VWG RGCPS (SEQ ID NO:1584). 

91 . The amino acid sequence according to claim 80, wherein the amino 
acid sequence is sequence 539 (SEQ ID NO:21 16). 
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92. The amino acid sequence according to claim 80, wherein the amino 
acid sequence is selected from the group consisting of sequences 
S429 (SEQ ID NO:2032), S455 (SEQ ID NO:2060) f S457-S458 (SEQ 
ID NOS:2063-2064), S467-S468 (SEQ ID NOS:2066-2067). S471 
(SEQ ID NO:2068), S471 (SEQ ID NO:2068), S481-S513 (SEQ ID 
NOS:2069-2t01), S517-S520 (SEQ ID NOS:21Q4-2107), S524(SEQ 
ID NO:2111), RB539 (SEQ ID NO:2196), RB625-RB626 (SEQ ID 
NOS:2200 and 2199), and RB622 (SEQ ID NO:2201). 

93. The amino acid sequence according to claim 80, wherein the amino 
acid sequence is selected from the group consisting of sequences 
RP27 (SEQ ID NO:2213), RP28 (SEQ ID NO:2214), RP29 (SEQ ID 
NO:2215), RP30 (SEQ ID NO:2216), RP31 (SEQ ID NO:2217), RP32 
(SEQ ID NO:2218), RP33 (SEQ ID NO:2219), RP34 (SEQ ID 
NO:2220), and RP35 (SEQ ID NO:2221). 

94. The amino acid sequence according to claim 80, wherein the amino 
acid sequence is selected from the group consisting of sequences 
D8-6aa-S175 (SEQ ID NO:2121), D8-12aa-S175 (SEQ ID NO:2122), 
D8-6aa-RP15 (SEQ ID NO:2126), and D8-6aa-RP17 (SEQ ID 
NO:2127). 

95. The amino acid sequence according to claim 79, wherein the Site 1 
sequence consists essentially of a Formula 1 sequence X1X2X3X4X5 
and the Site 2 sequence consists essentially of a Formula 4 
sequence X22X23X24X25X26X27X28X29X30X31X32X33X34X35X36X37X38X39 
X40X4L and wherein X1, X2, X4, and X 5 are aromatic amino acids, and 
X 3 is any polar amino acid, and wherein X22, X 25 , X 2 e, X 2 a, X29, X 30 , 
X3 3> X34, X35, X37, X 3 8, X40 and X41 are any amino acid; X23 is any 
hydrophobic amino acid; X 27 is a polar amino acid; X31 is an aromatic 
amino acid; X32 is a small amino acid; and wherein at least one 
cysteine is located at positions X24 through X 2 7 and one at X 39 orX4o. 
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96. The amino acid sequence according to claim 95, wherein Xi, X 2 , and 
X 5 are selected from the group consisting of phenylalanine and 
tyrosine, X3 is selected from the group consisting of aspartic acid, 
glutamic acid, glycine and serine, and X4 is selected from group 
consisting of tryptophan, tyrosine and phenylalanine. 

97. The amino acid sequence according to claim 95, wherein X 2 4 and X 39 
are cysteines, X23 is selected from leucine, isoleucine, methionine 
and valine; X27 is selected from glutamic acid, aspartic acid, 
asparagine, and glutamine; X 31 is tryptophan, X32 is glycine; and X 3 e 
is any aromatic amino acid. 

98. The amino acid sequence according to claim 95, wherein the 
Formula 1 sequence XiX 2 X3X4X 5 is FYDWF (SEQ ID NO:1554). 

99. The amino acid sequence according to claim 95, wherein the 
Formula 4 sequence X22X23X24X25X26X27X28X29X30 
X31X32X33X34X35X36X37X38 X39X40X41 is HLCVLEELFWGASLFGYCSG 
(SEQIDNO:1576). 

100. The amino acid sequence according to claim 95, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences SEQ ID NOS:1-712 (Figures 1A-10); SEQ ID NOS:1221- 
1243 (Figure 8); and SEQ ID NOS:1596, 1718-1719, 1556, 1560, and 
1720-1776 (Table 2). 

101. The amino acid sequence according to claim 95, wherein the 
Formula 4 sequence is selected from the group consisting of 
sequences S262 (SEQ ID NO:1898), S282-S283 (SEQ ID 
NOS:1907-1908), S331 (SEQ ID NO:1949), RB505M (SEQ ID 
NO:2160), RB446 (SEQ ID NO:2180), and RB505 (SEQ ID 
NO:2191). 
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102. The amino acid sequence according to claim 95, wherein the 
Formula 4 sequence is selected from the group consisting of 
sequences RB517M (SEQ ID NO:2161), RB515 (SEQ ID NO:2162), 
and RB510 (SEQ ID NO:2163). 

103. The amino acid sequence according to claim 95, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences S105-S116 (SEQ ID NOS:1791-1805, 1556, and 1806- 
1807), S131 (SEQ ID NO:1820), S137 (SEQ ID NO:1821), S158 
(SEQ ID NO:1780), S165-S168 (SEQ ID NOS:1554, and 1824-1826), 
S171 (SEQ ID NO:1831), S175-S176 (SEQ ID NOS:1560 and 1836), 
S179-S184 (SEQ ID NOS: 1839-1 844), S214-216 (SEQ ID 
NOS: 1845-1 847), S219-223 (SEQ ID NOS: 1852-1 856), S227 (SEQ 
ID NO:1858), S234-245 (SEQ ID NOS:1 869-1 880), S248-S251(SEQ 
ID NOS: 1883-1 886), S264-S265 (SEQ ID NOS:1 900-1 901), S268 
(SEQ ID NO:1903), S278 (SEQ ID NO:1905), S287 (SEQ ID 
NO:1911), S294-S295 (SEQ ID NOS: 1922-1 923), S315 (SEQ ID 
NO:1937), S319-322 (SEQ ID NOS:1 940-1 943), S326 (SEQ ID 
NO:1600), S342 (SEQ ID NO:1962), S365-S366 (SEQ ID NOS:1987- 
1988), S371-S373 (SEQ ID NOS:1558 and 1990-1991), S386-S403 
(SEQ ID NOS: 1559, 2005-2007, 1794, 2008-2009, 1788, 1787, 1789, 
2010-2011, 1791, and 2012-2014), RB437 (SEQ ID NO:2164), 
RB502 (SEQ ID NO:2170), RB452 (SEQ ID NO:2173), RB513 (SEQ 
ID NO:2176), RB464 (SEQ ID NO:2179), RB596 (SEQ ID NO:2202), 
RB569 (SEQ ID NO:2203), and RB570 (SEQ ID NO:2204). 



WO 03/070747 



PCT/US02/30312 



-183- 

104. The amino acid sequence according to claim 95, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences H2C/D117: 
FHENFYDWFVRQVSKK (SEQ ID NO:1556); 

FHENFYDWFVRQVS (SEQ ID NO:1557); 
RP9: GSLDESFYDWFERQLGKK (SEQ ID NO:1558); 
GSLDESFYDWFERQLG (SEQ ID NO:1559); 
GLADEDFYEWFERQLR (SEQ ID NO: 1561); 
GLADELFYEWFDRQLS (SEQ ID NO: 1 562); 
GQLDEDFYEWFDRQLS (SEQ ID NO: 1563); 
GQLDEDFYAWFDRQLS (SEQ ID NO: 1564); 
GFMDESFYEWFERQLR (SEQ ID NO: 1565); 
GFWDESFYAWFERQLR (SEQ ID NO: 1566); 
GFMDESFYAWFERQLR (SEQ ID NO: 1567); 
GFWDESFYEWFERQLR (SEQ ID NO: 1568); 
RP15: SQAGSAFYAWFDQVLRTV (SEQ ID NO:2057); and 
S175: GRVDWLQRNANFYDWFVAELG (SEQ ID NO:1560). 

105. The amino acid sequence according to claim 95, wherein the amino 
acid sequence is selected from the group consisting of sequences 
F8-6aa-RP9 (SEQ ID NO:2119), F8-12aa-RP9 (SEQ ID NO:2120), 
F8-6aa-S175 (SEQ ID NO:2123), F8-12aa-S175 (SEQ ID NO:2124), 
S516 (SEQ ID NO:2103), S521 (SEQ ID NO:2108). and S522 (SEQ 
ID NO:2109). 

106. An amino acid sequence that is an insulin receptor agonist, which 
comprises a plurality of subsequences that each comprise a 
sequence that binds to Site 1 of insulin receptor, with the proviso that 
the amino acid sequence is not insulin, insulin-like growth factor, or 
fragments thereof. 
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107. The amino acid sequence according to claim 106, wherein the Site 1 
sequence consists essentially of a Formula 1 sequence X1X2X3X4X5, 
wherein X 1f X 2 , X4, and X 5 are aromatic amino acids, and X 3 is any 
polar amino acid. 

108. The amino acid sequence according to claim 107, wherein X if X2, 
and X 5 are selected from the group consisting of phenylalanine and 
tyrosine, X 3 is selected from the group consisting of aspartic acid, 
glutamic acid, glycine and serine, and X4 is selected from group 
consisting of tryptophan, tyrosine and phenylalanine. 

109. The amino acid sequence according to claim 107, wherein the 
Formula 1 sequence ^2X3X4X5 is FYDWF (SEQ ID NO:1554). 

110. The amino acid sequence according to claim 107, wherein the 
Formula 1 sequence is selected from the group consisting of SEQ ID 
NOS:1-712 (Figures 1A-10); SEQ ID NOS:1221-1243 (Figure 8); and 
SEQ ID NO:1596, 1718-1719, 1556, 1560, and 1720-1776 (Table 2). 
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111. The amino acid sequence according to claim 107, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences S105-S116 (SEQ ID NOS:1791-1805, 1556, and 1806- 
1807), S131 (SEQ ID NO:1820), S137 (SEQ ID NO:1821), S158 
(SEQ ID NO:1780), S165-S168 (SEQ ID NOS:1554, and 1824-1826), 
S171 (SEQ ID NO:1831), S175-S176 (SEQ ID NOS:1560 and 1836), 
S179-S184 (SEQ ID NOS: 1839-1 844), S214-216 (SEQ ID 
NOS:1 845-1847), S21 9-223 (SEQ ID NOS: 1852-1 856), S227 (SEQ 
ID NO:1858), S234-245 (SEQ ID NOS: 1869-1 880), S248-S251(SEQ 
ID NOS:1883-1886), S264-S265 (SEQ ID NOS:1900-1901), S268 
(SEQ ID NO:1903), S278 (SEQ ID NO:1905), S287 (SEQ ID 
NO:1911), S294-S295 (SEQ ID NOS: 1922-1 923), S315 (SEQ ID 
NO: 1937), S3 19-322 (SEQ ID NOS: 1940-1 943), S326 (SEQ ID 
NO:1600), S342 (SEQ ID NO: 1962), S365-S366 (SEQ ID 
NOS:1 987-1 988), S371-S373 (SEQ ID NOS:1558, and 1990-1991), 
S386-S403 (SEQ ID NOS: 1559, 2005-2007, 1794, 2008-2009, 1788, 
1787, 1789, 2010-2011, 1791, and 2012-2014), RB437 (SEQ ID 
NO:2164), RB502 (SEQ ID NO:2170), RB452 (SEQ ID NO:2173), 
RB513 (SEQ ID NO:2176), RB464 (SEQ ID NO:2179), RB596 (SEQ 
ID NO:2202), RB569 (SEQ ID NO:2203), and RB570 (SEQ ID 
NO:2204). 
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112. The amino acid sequence according to claim 107, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences 

H2C/D117: FHENFYDWFVRQVSKK (SEQ ID NO:1556); 
FHENFYDWFVRQVS (SEQ ID NO:1557); 
RP9: GSLDESFYDWFERQLGKK (SEQ ID NO:1558); 
GSLDESFYDWFERQLG (SEQ ID NO:1559); 
GLADEDFYEWFERQLR (SEQ ID NO:1561); 
GLADELFYEWFDRQLS (SEQ ID NO: 1562); 
GQLDEDFYEWFDRQLS (SEQ ID NO:1563); 
GQLDEDFYAWFDRQLS (SEQ ID NO:1564); 
GFMDESFYEWFERQLR (SEQ ID NO:1565); 
GFWDESFYAWFERQLR (SEQ ID NO: 1566); 
GFMDESFYAWFERQLR (SEQ ID NO:1567); 
GFWDESFYEWFERQLR (SEQ ID NO: 1568); 
RP15: SQAGSAFYAWFDQVLRTV (SEQ ID NO:2057); and 
S175: GRVDWLQRNANFYDWFVAELG (SEQ ID NO: 1560). 

113. The amino acid sequence according to claim 107, wherein the amino 
acid sequence is selected from the group consisting of sequences 
521 (SEQ ID NO:2112) and 535 (SEQ ID NO:21 13). 

1 14. The amino acid sequence according to claim 107, wherein the amino 
acid sequence is selected from the group consisting of sequences 
434 (SEQ ID NO:2138), 436 (SEQ ID NO:2139), 427 (SEQ ID 
NO:2141), 435 (SEQ ID NO:2142), 439 (SEQ ID NO:2143), 449 
(SEQ ID NO:2144), and 463 (SEQ ID NO:2146). 
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115. The amino acid sequence according to claim 107, wherein the amino 
acid sequence is selected from the group consisting of sequences 
S128 (SEQ ID NOS:1817-1818). SMS (SEQ ID NOS:1822-1823), 
S169-S170 (SEQ ID NOS:1 827-1 830). S172 (SEQ ID NOS:1832- 
1833), S218 (SEQ ID NOS: 1850-1 851), S228 (SEQ ID NOS:1859- 
1860), S231-232 (SEQ ID NOS:1 863-1 866), S253 (SEQ ID 
NOS: 1889-1 890), S267 (SEQ ID NO:), S290-S293 (SEQ ID 
NOS: 191 4-1 921), S300-S301 (SEQ ID NOS: 1924-1 927), S312 (SEQ 
ID NOS: 1933-1 934), S325 (SEQ ID NOS: 1944-1 945), S329 (SEQ ID 
NOS: 1947-1 948), S332-S337 (SEQ ID NOS:1950-1961), S349-S354 
(SEQ ID NOS:1 965-1 976), S359-S363 (SEQ ID NOS: 1977-1 986), 
S374-S376 (SEQ ID NOS: 1992-1 996), S378-S381 (SEQ ID 
NOS:1 997-2004), S414-S418 (SEQ ID NOS: 20 15-2024), S420 (SEQ 
ID NOS:2027-2028), RB463 (SEQ ID NO:2165), RB439 (SEQ ID 
NO:2166), RB436 (SEQ ID NO:2167), RB449 (SEQ ID NO:2168), 
RB508M-RB509M (SEQ ID NOS:2171-2172), RB508-RB509 (SEQ 
ID NOS:21 89-21 90), RB521 (SEQ ID NO:2193), and RB535 (SEQ ID 
NO:2194). 

116. An amino acid sequence that is an insulin receptor agonist, which 
comprises a subsequence that comprises a sequence that binds to 
Site 1 of insulin receptor and a subsequence that comprises a 
sequence that binds to Site 2 of insulin receptor, wherein the 
subsequences are linked C-terminus to C-terminus or N-terminus to 
N-terminus, with the proviso that the amino acid sequence is not 
insulin, insulin-like growth factor, or fragments thereof. 
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117. The amino acid sequence according to claim 116, wherein the Site 1 
sequence consists essentially of a Formula 1 sequence X 1 X 2 X 3 X4X 5 
and the Site 2 sequence consists essentially of a Formula 6 

Sequence X62 X63 X64 X65 X66 X67 X68 X69 X70 X71 X72 X73 X74 X75 X76 

X77 X 7 8 X79 Xeo Xai, and wherein Xi, X2, X4, and X 5 are aromatic 
amino acids, and X 3 is any polar amino acid, and wherein X 6 2, X65, 
X66 X68» Xe9, X71, X73, X76, X771 X78f Xbo and Xsi are any amino acid; 
X63. X 7 o, and X74 are hydrophobic amino acids; X64 is a polar amino 
acid; X 6 7 and X75 are aromatic amino acids; and X 72 and X79 are 
cysteines. 

118. The amino acid sequence according to claim 117, wherein X1, X 2 , 
and X 5 are selected from the group consisting of phenylalanine and 
tyrosine, X3 is selected from the group consisting of aspartic acid, 
glutamic acid, glycine and serine, and X4 is selected from group 
consisting of tryptophan, tyrosine and phenylalanine. 

119. The amino acid sequence according to claim 117, wherein Xe3 is 
selected from the group consisting of leucine, isoleucine, methionine 
and valine; X70 and X74 are selected from group consisting of valine, 
isoleucine, leucine and methionine; X64 is selected from group 
consisting of aspartic acid and glutamic acid; X 6 7 is tryptophan; and 
X75 is selected from group consisting of tyrosine and tryptophan. 

120. The amino acid sequence according to claim 117, wherein the 
Formula 1 sequence X1X2X3X4X5 is FYDWF (SEQ ID NO:1554). 

121. The amino acid sequence according to claim 117, wherein the 
Formula 6 sequence X62 X63 X64 X65 X66 X67 X68 X69 X70 X71 X72 X73 
X74 X 75 X 76 X77 X 78 X 79 Xso Xei is WLDQEWAWVQCEVYGRGCPS 
(SEQ ID NO:2056). 
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122. The amino acid sequence according to claim 117, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences SEQ ID NOS:1-712 (Figures 1A-10); SEQ ID NOS:1221- 
1243 (Figure 8); and SEQ ID NO: 1596, 1718-1719, 1556, 1560, and 
1720-1776 (Table 2). 

123. The amino acid sequence according to claim 117, wherein the 
Formula 6 sequence is selected from the group consisting of 
sequences SEQ ID NOS:926-1061 (Figures 3A-3E); SEQ ID 
NOS:1244-1253 (Figure 9A); and SEQ ID NO:1596, 1718-1719, 
1556, 1560, and 1720-1776 (Table 2). 

124. The amino acid sequence according to claim 117, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences S105-S116 (SEQ ID NOS: 1791 -1805, 1556, and 1806- 
1807), S131 (SEQ ID NO:1820), S137 (SEQ ID NO:1821), S158 
(SEQ ID NO:1780), S165-S168 (SEQ ID NOS:1554, and 1824-1826), 
S171 (SEQ ID NO:1831), S175-S176 (SEQ ID NOS:1560 and 1836), 
S179-S184 (SEQ ID NOS:1839-1844), S214-216 (SEQ ID 
NOS:1845-1847), S219-223 (SEQ ID NOS: 1852-1 856), S227 (SEQ 
ID NO:1858), S234-245 (SEQ ID NOS:1869-1880), S248-S251(SEQ 
ID NOS:1883-1886), S264-S265 (SEQ ID NOS:1 900-1 901), S268 
(SEQ ID NO:1903), S278 (SEQ ID NO:1905), S287 (SEQ ID 
NO:1911), S294-S295 (SEQ ID NOS:1922-1923), S315 (SEQ ID 
NO:1937), S319-322 (SEQ ID NOS:1940-1943), S326 (SEQ ID NO: 
1600), S342 (SEQ ID NO: 1962), S365-S366 (SEQ ID NOS:1987- 
1988), S371-S373 (SEQ ID NOS: 1558, and 1990-1991), S386-S403 
(SEQ ID NOS:1559, 2005-2007, 1794, 2008-2009, 1788, 1787, 1789, 
2010-2011, 1791, and 2012-2014), RB437 (SEQ ID NO:2164), 
RB502 (SEQ ID NO:2170), RB452 (SEQ ID NO:2173), RB513 (SEQ 
ID NO:2176), RB464 (SEQ ID NO:2179), RB596 (SEQ ID NO:2202), 
RB569 (SEQ ID NO:2203), and RB570 (SEQ ID NO:2204). 
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125. The amino acid sequence according to claim 117, wherein the 
Formula 6 sequence is selected from the group consisting of 
sequences S256 (SEQ ID NO:1893), S263 (SEQ ID NO:1899), S266 
(SEQ ID NO:1902), S284-285 (SEQ ID NO:DD-1 909-1 910), S515 
(SEQ ID NO:2102), and RB426 (SEQ ID NO:2158). 

126. The amino acid sequence according to claim 117, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences 

H2C/D117: FHENFYDWFVRQVSKK (SEQ ID NO:1556); 
FHENFYDWFVRQVS (SEQ ID NO:1557); 
RP9: GSLDESFYDWFERQLGKK (SEQ ID NO: 1558); 
GSLDESFYDWFERQLG (SEQ ID NO: 1559); 
GLADEDFYEWFERQLR (SEQ ID NO:1561); 
GLADELFYEWFDRQLS (SEQ ID NO:1562); 
GQLDEDFYEWFDRQLS (SEQ ID NO:1563); 
GQLDEDFYAWFDRQLS (SEQ ID NO:1564); 
GFMDESFYEWFERQLR (SEQ ID NO:1565); 
GFWDESFYAWFERQLR (SEQ ID NO:1566); 
GFMDESFYAWFERQLR (SEQ ID NO: 1567); 1 
GFWDESFYEWFERQLR (SEQ ID NO:1568); 
RP15: SQAGSAFYAWFDQVLRTV (SEQ ID NO:2057); and 
S175: GRVDWLQRNANFYDWFVAELG (SEQ ID NO:1560). 
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127. The amino acid sequence according to claim 117, wherein the 
Formula 6 sequence is a D8 sequence selected from the group 
consisting of: 

WLDQEWVQCEVYGRGCPSKK (SEQ ID NO:2227); 
KWLDQEWAWVQCEVYGRGCPSKK (SEQ ID NO:1579); 
KWLDQEWAWVQCEVYGRGCPS (SEQ ID NO:1580); 
SLEEEWAQIQCEIYGRGCRY (SEQ ID NO:1581); 
SLEEEWAQIQCEIWGRGCRY (SEQ ID NO: 1582); 
SLEEEWAQIECEVYGRGCPS (SEQ ID NO:1583); and 
SLEEEWAQIECEVWGRGCPS (SEQ ID NO:1584). 

128. The amino acid sequence according to claim 117, wherein the amino 
acid sequence is selected from the group consisting of sequences 
S432-S433 (SEQ ID NOS:2033-2036), S436-S445 (SEQ ID 
NOS:2037-2056), and S456 (SEQ ID NOS:2061-2062). 

129. An amino acid sequence that is an insulin receptor antagonist, which 
comprises a subsequence that comprises a sequence that binds to 
Site 1 of insulin receptor and a subsequence that comprises a 
sequence that binds to Site 2 of insulin receptor, wherein the 
subsequences are linked C-terminus to N-terminus and the 
subsequences are oriented Site 1 to Site 2, with the proviso that the 
amino acid sequence is not insulin, insulin-like growth factor, or 
fragments thereof. 
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130. The amino acid sequence according to claim 129, wherein the Site 1 
sequence consists essentially of a Formula 1 sequence X1X2X3X4X5 
and the Site 2 sequence consists essentially of a Formula 6 
sequence X62 X$3 X64 X65 X66 X67 X68 X69 X70 X71 X72 X73 X74 X75 X76 
X77 X 7 e X79 X 8 o Xsi, and wherein Xi, X 2 , X4, and X 5 are aromatic 
amino acids, and X 3 is any polar amino acid, and wherein X 6 2, X65, 
X66 X68. X69, X71, X731 X76, X77 f X78, Xso and Xai are any amino acid; 
Xe3, X 7 o, and X74 are hydrophobic amino acids; X64 is a polar amino 
acid; X67 and X75 are aromatic amino acids; and X72 and X 79 are 
cysteines. 

131. The amino acid sequence according to claim 130, wherein X1, X 2 , 
and X5 are selected from the group consisting of phenylalanine and 
tyrosine, X3 is selected from the group consisting of aspartic acid, 
glutamic acid, glycine and serine, and X4 is selected from group 
consisting of tryptophan, tyrosine and phenylalanine. 

132. The amino acid sequence according to claim 130, wherein Xe3 is 
selected from the group consisting of leucine, isoleucine, methionine 
and valine; X70 and X74 are selected from group consisting of valine, 
isoleucine, leucine and methionine; X64 is selected from group 
consisting of aspartic acid and glutamic acid; X 6 7 is tryptophan; and 
X75 is selected from group consisting of tyrosine and tryptophan. 

133. The amino acid sequence according to claim 130, wherein the 
Formula 1 sequence X1X2X3X4X5 is FYDWF (SEQ ID NO:1554). 

134. The amino acid sequence according to claim 130, wherein the 
Formula 6 sequence X62 X63 X64 X65 X66 X67 X68 X69 X70 X71 X72 X73 
X74 X 75 X 76 X 77 X 78 X 79 Xso X 8 i is WLDQEWAWVQCEVYGRGCPS 
(SEQ ID NO:2056). 
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135. The amino acid sequence according to claim 130, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences SEQ ID NOS:1-712 (Figures 1A-10); SEQ ID NOS: 1221- 
1243 (Figure 8); and SEQ ID NO:1596, 1718-1719, 1556, 1560, and 
1720-1776 (Table 2). 

136. The amino acid sequence according to claim 130, wherein the 
Formula 6 sequence is selected from the group consisting of 
sequences SEQ ID NO:926-1061 (Figures 3A-3E); SEQ ID NO: 1244- 
1253 (Figure 9A); and SEQ ID NO:1596, 1718-1719, 1556, 1560, and 
1720-1776 (Table 2). 

137. The amino acid sequence according to claim 130, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences S105-S116 (SEQ ID NOS:1791-1805, 1556, and 1806- 
1807), S131 (SEQ ID NO:1820), S137 (SEQ ID NO:1821), S158 
(SEQ ID NO:1780), S165-S168 (SEQ ID NOS:1554, and 1824-1826), 
S171 (SEQ ID NO:1831), S175-S176 (SEQ ID NOS:1560 and 1836), 
S179-S184 (SEQ ID NOS: 1839-1 844), S214-216 (SEQ ID 
NOS: 1845-1 847), S219-223 (SEQ ID NOS:1 852-1 856), S227 (SEQ 
ID NO:1858), S234-245 (SEQ ID NOS:1 869-1 880), S248-S251 (SEQ 
ID NOS:1 883-1 886), S264-S265 (SEQ ID NOS: 1900-1 901), S268 
(SEQ ID NO:1903), S278 (SEQ ID NO:1905), S287 (SEQ ID 
NO:1911), S294-S295 (SEQ ID NOS: 1922-1 923), S315 (SEQ ID 
NO:1937), S319-322 (SEQ ID NOS:1 940-1 943), S326 (SEQ ID 
NO:1600), S342 (SEQ ID NO:1962), S365-S366 (SEQ ID NOS:1987- 
1988), S371-S373 (SEQ ID NOS: 1558, 1990-1991), S386-S403 
(SEQ ID NOS:1559, 2005-2007, 1794, 2008-2009, 1788, 1787, 1789, 
2010-2011, 1791, and 2012-2014), RB437 (SEQ ID NO:2164), 
RB502 (SEQ ID NO:2170), RB452 (SEQ ID NO:2173), RB513 (SEQ 
ID NO:2176), RB464 (SEQ ID NO:2179), RB596 (SEQ ID NO:2202), 
RB569 (SEQ ID NO:2203), and RB570 (SEQ ID NO:2204). 
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138. The amino acid sequence according to claim 130, wherein the 
Formula 6 sequence is selected from the group consisting of 
sequences S256 (SEQ ID NO:1893), S263 (SEQ ID NO:1899), S266 
(SEQ ID NO:1902), S284-285 (SEQ ID NOS: 1909-1 910), S515 (SEQ 
ID NO:2102), and RB426 (SEQ ID NO:2158). 

139. The amino acid sequence according to claim 130, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences 

H2C/D117: FHENFYDWFVRQVSKK (SEQ ID NO: 1556); 
FH EN FYDWFVRQVS (SEQ ID NO:1557); 
RP9: GSLDESFYDWFERQLGKK (SEQ ID NO:1558); 
GSLDESFYDWFERQLG (SEQ ID NO:1559); 
GLADEDFYEWFERQLR (SEQ ID NO:1561); 
GLADELFYEWFDRQLS (SEQ ID NO:1562); 
GQLDEDFYEWFDRQLS (SEQ ID NO:1563); 
GQLDEDFYAWFDRQLS (SEQ ID NO:1564); 
GFMDESFYEWFERQLR (SEQ ID NO:1565); 
GFWDESFYAWFERQLR (SEQ ID NO:1566); 
GFMDESFYAWFERQLR (SEQ ID NO:1567); 
GFWDESFYEWFERQLR (SEQ ID NO:1568); 
RP15: SQAGSAFYAWFDQVLRTV (SEQ ID NO:2057); and 
S175: GRVDWLQRNANFYDWFVAELG (SEQ ID NO:1560). 
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140. The amino acid sequence according to claim 130, wherein the 
Formula 6 sequence is a D8 sequence selected from the group 
consisting of: 

WLDQEWVQCEVYGRGCPSKK (SEQ ID NO:2227); 
KWLDQEWAWVQCEVYGRGCPSKK (SEQ ID NO:1579); 
KWLDQEWAWVQCEVYGRGCPS (SEQ ID NO:1580); 
SLEEEWAQIQCEIYGRGCRY (SEQ ID NO:1581); 
SLEEEWAQIQCEIWGRGCRY (SEQ ID NO:1582); 
SLEEEWAQIECEVYGRGCPS (SEQ ID NO: 1583); and 
SLEEEWAQIECEVWGRGCPS (SEQ ID NO:1584). 

141. The amino acid sequence according to claim 130, wherein the amino 
acid sequence is selected from the group consisting of sequences 
537-538 (SEQ ID NOS:21 14-21 15). 

142. The amino acid sequence according to claim 130, wherein the amino 
acid sequence is selected from the group consisting of sequences 
S425 (SEQ ID NO:2031), S454 (SEQ ID NOS:2058-2059), S459 
(SEQ ID NO:2065), and RB537-RB538 (SEQ ID NOS:2197-2198). 

143. The amino acid sequence according to claim 129, wherein the Site 1 
sequences consists essentially of a Formula 1 sequence X1X2X3X4X5 
and the Site 2 sequence consists essentially of a Formula 4 
sequence X22X23X24X25X26X27X28X29X30X31X32X33X34X35X36X37X38X39 
X40X41, and wherein X1, X 2 , X*. and X 5 are aromatic amino acids, and 
X 3 is any polar amino acid, and wherein X22, X25, X26, X 2 8, X29, X30, 
X33, X34, X 35 , X37, X 38 , X40 and X41 are any amino acid; X23 is any 
hydrophobic amino acid; X27 is a polar amino acid; X31 is an aromatic 
amino acid; X32 is a small amino acid; and wherein at least one 
cysteine is located at positions X 2 4 through X27 and one at X 39 orX4o. 
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144. The amino acid sequence according to claim 143, wherein Xi, X 2 , 
and X 5 are selected from the group consisting of phenylalanine and 
tyrosine, X 3 is selected from the group consisting of aspartic acid, 
glutamic acid, glycine and serine, and X4 is selected from group 
consisting of tryptophan, tyrosine and phenylalanine. 

145. The amino acid sequence according to claim 143, wherein X24 and 
X39 are cysteines, X23 is selected from leucine, isoleucine, methionine 
and valine; X27 is selected from glutamic acid, aspartic acid, 
asparagine, and glutamine; X 31 is tryptophan, X32 is glycine; and X36 
is any aromatic amino acid. 

146. The amino acid sequence according to claim 143, wherein the 
Formula 1 sequence ^2X3X4X5 is FYDWF (SEQ ID NO:1554). 

147. The amino acid sequence according to claim 143, wherein the 

Formula 4 Sequence X 2 2X23X24X25X26X27X28X29X30X3iX32X33X34X35X36 

X37X38X39X40X41 is HLCVLEELFWGASLFGYCSG (SEQ ID NO:1576). 

148. The amino acid sequence according to claim 143, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences SEQ ID NOS:1-712 (Figures 1A-10); SEQ ID NOS:1221- 
1243 (Figure 8); and SEQ ID NOS:1596, 1718-1719, 1556, 1560, and 
1720-1776 (Table 2). 

149. The amino acid sequence according to claim 143, wherein the 
Formula 4 sequence is selected from the group consisting of 
sequences SEQ ID NOS:713-925 (Figures 2A-2E); SEQ ID 
NOS:1254-1261 (Figure 9B); and SEQ ID NOS:1596, 1718-1719, 
1556, 1560, and 1720-1776 (Table 2). 
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150. The amino acid sequence according to claim 143, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences S105-S116 (SEQ ID NOS: 1791 -1805), S131 (SEQ ID 
NO:1820), S137 (SEQ ID NO:1821), S158 (SEQ ID NO:1780), S165- 
S168 (SEQ ID NOSM554, and 1824-1826), S171 (SEQ ID NO:1831), 
S175-S176 (SEQ ID NOS:1560 and 1836), S179-S184 (SEQ ID 
NOS:1 839-1 844), S214-216 (SEQ ID NOS: 1845-1 847), S219-223 
(SEQ ID NOS: 1852-1 856), S227 (SEQ ID NO:1858), S234-245 (SEQ 
ID NOS:1869-1880), S248-S251(SEQ ID NOS: 1883-1 886), S264- 
S265 (SEQ ID NOS:1 900-1 901), S268 (SEQ ID NO:1903), S278 
(SEQ ID NO:1905), S287 (SEQ ID NO:1911), S294-S295 (SEQ ID 
NOS:1 922-1 923), S315 (SEQ ID NO:1937), S319-322 (SEQ ID 
NOS:1 940-1 943), S326 (SEQ ID NO:1600), S342 (SEQ ID NO:1962), 
S365-S366 (SEQ ID NOS: 1987-1 988), S371-S373 (SEQ ID 
NOS:1558, and 1990-1991), S386-S403 (SEQ ID NOS:1559, 2005- 
2007, 1794, 2008-2009, 1788, 1787, 1789, 2010-2011, 1791, and 
2012-2014), RB437 (SEQ ID NO:2164), RB502 (SEQ ID NO:2170), 
RB452 (SEQ ID NO:2173), RB513 (SEQ ID NO:2176), RB464 (SEQ 
ID NO:2179), RB596 (SEQ ID NO:2202), RB569 (SEQ ID NO:2203), 
and RB570 (SEQ ID NO:2204). 

151. The amino acid sequence according to claim 143, wherein the 
Formula 4 sequence is selected from the group consisting of 
sequences S262 (SEQ ID NO:1898), S282-S283 (SEQ ID 
NOS:1 907-1 908), S331 (SEQ ID NO:1949), RB505M (SEQ ID 
NO:2160), RB446 (SEQ ID NO:2180), and RB505 (SEQ ID 
NO:2191). 

152. The amino acid sequence according to claim 143, wherein the 
Formula 4 sequence is selected from the group consisting of 
sequences RB517M (SEQ ID NO:2161), RB515 (SEQ ID NO:2162), 
and RB510 (SEQ ID NO:2163). 
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153. The amino acid sequence according to claim 143, wherein the 
Formula 1 sequence is selected from the group consisting of 
sequences 

H2C/D1 17: FHENFYDWFVRQVSKK (SEQ ID NO: 1556); 
FHENFYDWFVRQVS (SEQ ID NO: 1557); 
RP9: GSLDESFYDWFERQLGKK (SEQ ID NO:1558); 
GSLDESFYDWFERQLG (SEQ ID NO:1559); 
GLADEDFYEWFERQLR (SEQ ID NO:1561); 
GLADELFYEWFDRQLS (SEQ ID NO:1562); 
GQLDEDFYEWFDRQLS (SEQ ID NO: 1563); 
GQLDEDFYAWFDRQLS (SEQ ID NO:1564); 
GFMDESFYEWFERQLR (SEQ ID NO:1565); 
GFWDESFYAWFERQLR (SEQ ID NO: 1566); 
GFMDESFYAWFERQLR (SEQ ID NO: 1567); 
GFWDESFYEWFERQLR (SEQ ID NO: 1568); 
RP15: SQAGSAFYAWFDQVLRTV (SEQ ID NO:2057); and 
S175: GRVDWLQRNANFYDWFVAELG (SEQ ID NO:1560). 

154. The amino acid sequence according to claim 143, wherein the amino 
acid sequence is selected from the group consisting of sequences 
431-433 (SEQ ID NOS:21 35-21 37). 

155. The amino acid sequence according to claim 143, wherein the amino 
acid sequence is selected from the group consisting of sequences 
RB431-RB433 (SEQ ID NOS:2184, 2186, and 2188). 

156. A pharmaceutical composition comprising the amino acid sequence 
according to claim 79, and a physiologically acceptable carrier, 
excipient or diluent. 

157. A pharmaceutical composition comprising the amino acid sequence 
according to claim 106, and a physiologically acceptable carrier, 
excipient or diluent. 
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158. A pharmaceutical composition comprising an amino acid sequence 
according to claim 116, a physiologically acceptable carrier, excipient 
or diluent. 

159. A pharmaceutical composition comprising the amino acid sequence 
according to claim 129, and a physiologically acceptable carrier, 
excipient or diluent. 

160. A method of treating diabetes comprising administering to an 
individual in need of treatment a therapeutically effective amount of 
the pharmaceutical composition according to claim 156, thereby 
treating the diabetes. 

161. A method of treating diabetes comprising administering to an 
individual in need of treatment a therapeutically effective amount of 
the pharmaceutical composition according to claim 157, thereby 
treating the diabetes. 

162. A method of treating diabetes comprising administering to an 
individual in need of treatment a therapeutically effective amount of 
the pharmaceutical composition according to claim 158, thereby 
treating the diabetes. 

163. A method of treating insulin shock comprising administering to an 
individual in need of treatment a therapeutically effective amount of 
the pharmaceutical composition according to claim 159, thereby 
treating the insulin shock. 

164. A method of identifying an insulin agonist comprising: 

1) producing a secondary library comprising peptides derived from 
the amino acid sequence according to claim 79; 

2) screening the library peptides for binding to insulin receptor; and 

3) screening the insulin-binding peptides from (2) for agonist activity 
for insulin receptor, wherein agonist activity indicates identification of 
an insulin receptor agonist. 
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165. A method of identifying an insulin agonist comprising: 

1) producing a secondary library comprising peptides derived from 
the amino acid sequence according to claim 106; 

2) screening the library peptides for binding to insulin receptor; and 

3) screening the insulin-binding peptides from (2) for agonist activity 
for insulin receptor, wherein agonist activity indicates identification of 
an insulin receptor agonist. 

166. A method of identifying an insulin agonist comprising: 

1) producing a secondary library comprising peptides derived from 
the amino acid sequence according to claim 116; 

2) screening the library peptides for binding to insulin receptor; and 

3) screening the insulin-binding peptides from (2) for agonist activity 
for insulin receptor, wherein agonist activity indicates identification of 
an insulin receptor agonist. 

167. A method of identifying an insulin antagonist comprising: 

1) producing a secondary library comprising peptides derived from 
the amino acid sequence according to claim 129; 

2) screening the library peptides for binding to insulin receptor; and 

3) screening the insulin-binding peptides from (2) for antagonist 
activity for insulin receptor, wherein antagonist activity indicates 
identification of an insulin receptor antagonist. 

168. A method of increasing insulin receptor activity in mammalian cells 
comprising administering to said cells an amino acid sequence 
comprising a sequence selected from the group consisting of 
SLEEEWAQVECEVYGRGCPSGSLDESFYDWFERQLG (S519; SEQ 
ID NO:2033) and SIEEEWAQIKCDVWGRGCP 
PGLLDESFYHWFDRQLR (S520; SEQ ID NO:2034), and a 
physiologically acceptable carrier, excipient, or diluent, in amount 
sufficient to increase insulin activity. 
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169. An amino acid sequence that is an insulin receptor agonist, which 
comprises a sequence selected from the group consisting of 
SLEEEWAQVECEVYGRGCPSGSLDESFYDW FERQLG (S51 9; 
SEQ ID NO:2033) and 
SIEEEWAQIKCDVWGRGCPPGLLDESFYHWFDRQLR (S520; SEQ 
ID NO:2034). 

170. A pharmaceutical composition comprising the amino acid 
sequence according to claim 169, and a physiologically acceptable 
carrier, excipient or diluent. 

171. A method of treating diabetes comprising administering to an 
individual in need of treatment a therapeutically effective amount of 
the pharmaceutical composition according to claim 170, thereby 
treating the diabetes. 

172. A method of identifying an insulin agonist comprising: 

1) producing a secondary library comprising peptides derived from 
the amino acid sequence according to claim 169; 

2) screening the library peptides for binding to insulin receptor; and 

3) screening the insulin-binding peptides from (2) for agonist activity 
for insulin receptor, wherein agonist activity indicates identification of 
an insulin receptor agonist. 
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173. A method of increasing insulin-like growth factor receptor activity in 
mammalian cells comprising administering to said cells an amino acid 
sequence in an amount sufficient to increase insulin-like growth factor 
activity, wherein the amino acid sequence comprises a subsequence 
that comprises a sequence that binds to Site 1 of insulin-like growth 
factor receptor and a subsequence that comprises a sequence that 
binds to Site 2 of insulin-like growth factor receptor, and wherein the 
subsequences are linked C-terminus to N-terminus and the 
subsequences are oriented Site 2 to Site 1 , with the proviso that the 
amino acid sequence is not insulin, insulin-like growth factor, or 
fragments thereof. 

174. The method according to claim 173, wherein the Site 1 sequence 
consists essentially of a Formula 1 sequence X1X2X3X4X5 and the Site 
2 sequence consists essentially of a Formula 6 sequence X62 X63 X w 
X65 X66 X67 X68 X69 X70 X71 X72 X73 X74 X75 X76 X77 X78 X79 Xeo Xsi, and 
wherein Xi, X 2 , X4, and X 5 are aromatic amino acids, and X 3 is any 
polar amino acid, and wherein Xe2, X 6 5, X 6 e X 6 8, Xeg, X71, X73, X 7 e, X77, 
X 7 8, Xso and Xsi are any amino acid; X 6 3, X 7 o, and X74 are 
hydrophobic amino acids; X 6 4 is a polar amino acid; Xe7 and X75 are 
aromatic amino acids; and X72 and X79 are cysteines. 

175. An amino acid sequence that is an insulin-like growth factor receptor 
agonist, which comprises a subsequence that comprises a sequence 
that binds to Site 1 of the insulin-like growth factor receptor and a 
subsequence that comprises a sequence that binds to Site 2 of the 
insulin-like growth factor receptor, wherein the subsequences are 
linked C-terminus to N-terminus and the subsequences are oriented 
Site 2 to Site 1 , with the proviso that the amino acid sequence is not 
insulin, insulin-like growth factor, or fragments thereof. 
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176. The amino acid sequence of claim 175, wherein the Site 1 sequence 
consists essentially of a Formula 1 sequence X 1 X 2 X 3 X4X5 and the Site 
2 sequence consists essentially of a Formula 6 sequence X62 X 6 3 X64 
X65 X66 X67 X68 Xeg X70 X71 X72 X73 X74 X75 X76 X77 X78 X79 X30 Xai, and 
wherein X if X 2 , X4, and X 5 are aromatic amino acids, and X 3 is any 
polar amino acid, and wherein X62, X65, X66 X 6 s, Xeg, X71, X73, X 7 6, X77, 
X 7 8, Xfio and Xei are any amino acid; X 6 3, X70, and X74 are 
hydrophobic amino acids; X64 is a polar amino acid; X 6 7 and X75 are 
aromatic amino acids; and X72 and X79 are cysteines. 

177. An amino acid sequence that binds to insulin-like growth factor 
receptor, which comprises a sequence selected from the group 
consisting of a sequence that binds to Site 1 of the insulin-like growth 
factor receptor and a sequence that binds to Site 2 of the insulin-like 
growth factor receptor, with the proviso that the amino acid sequence 
is not insulin, insulin-like growth factor, or fragments thereof. 

178. The amino acid sequence of claim 177, wherein the Site 1 sequence 
consists essentially of a Formula 1 sequence X1X2X3X4X5, wherein 
X1, X 2> X4, and X$ are aromatic amino acids, and X3 is any polar 
amino acid. 

179. The amino acid sequence of claim 177, wherein the Site 1 sequence 
consists essentially of a Formula 2 sequence X6X7X8X9X10X1 1X12X13, 
wherein X 6 and X 7 are aromatic amino acids, Xa, X 9 , Xn, and X12 are 
any amino acid, and X10 and X i3 are hydrophobic amino acids. 

180. The amino acid sequence of claim 177, wherein the Site 2 sequence 
consists essentially of a Formula 6 sequence X 6 2 X63 X w X65 X 6 6 X 6 7 
X68 X69 X70 X71 X72 X73 X74 X75 X76 X77 X78 X 79 Xso Xei, wherein X62, 
Xes. X66 X68, X69, X71, X73, X76, X77, X78, Xao and Xsi are any amino 
acid; Xe3, X 7 o, and X74 are hydrophobic amino acids; X64 is a polar 
amino acid; Xe7 and X75 are aromatic amino acids; and X 72 and X79 
are cysteines. 
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181. The method according to claim 29, wherein the amino acid 
sequence is selected from the group consisting of: S527-S546; S549, 
D551-S591; S594-S624; S626-S639; and S641-S648. 

182. The method according to claim 181, wherein the amino acid 
sequence is selected from the group consisting of S557 and S597. 

183. A method of increasing insulin receptor activity in mammalian 
cells comprising administering to said cells an amino acid sequence 
selected from the group consisting of: S527-S546; S549, D551-S591; 
S594-S624; S626-S639; and S641-S648 and a physiologically 
acceptable carrier, excipient, or diluent, in an amount sufficient to 
increase insulin receptor activity. 

184. The method according to claim 183, wherein the amino acid 
sequence is selected from the group consisting of S557 and S597. 

185. An amino acid sequence that is an insulin receptor agonist, which 
comprises a sequence selected from the group consisting of: S527- 
S546; S549, D551-S591; S594-S624; S626-S639; and S641-S648. 

186. The sequence according to claim 185, wherein the sequence is 
selected from the group consisting of S557 and S597. 

187. A method of treating diabetes, said method comprising 
administering to a patient in need of such treatment (i) a first amount 
of a first compound selected from the group consisting of: peptides 
S519, S520; S524, S527-S546, S549, D551-S591, S594-S624, 
S626-S639, and S641-S648 and (ii) a second amount of a second 
compound comprising a long-acting insulin analogue, wherein the 
first and second amounts together are effective for treating said 
diabetes. 

188. The method according to claim 187, wherein said first compound 
is selected from the group consisting of: S557 and S597. 
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189. The method according to claim 187, wherein said long-acting 
insulin analogue is selected from the group consisting of LysB29( - 
myristoyl)des(B30) human insulin, LysB29( -tetradecanoyl)des(B30) 
human insulin, and B29-N -(N-lithocolyl- -glutamyl)-des(B30) human 
insulin 
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Unsearchable claims 

A. Claims 1. 28, 56, 66, 79, 106, 1 16, 129, 156-163, 173, 175 and 177-180 drawn lo insulin/IGF agonists or antagonist 
(pharmaceutical/use thereof) comprising a site 1 receptor binding peptide and a site 2 receptor binding peptides are unsearchable since 
the claims fail to provide peptide structure corresponding to the site 1 AND site 2 insulin/IGF receptor binding peptides necessary to 
construct a sequence search. 

B. Claims 164-167, 172 drawn to the use of "derivatives* of claims 79, 1 Of. 1 16, 129, 169 to screen for insulin agonists 
(claims 79, 106, 1 16, 169) or antagonists (claim 129) are unsearchable since the fiuJ structure of the derivative libraries is unknown, 
since neither the starting pepude structure (for claims 79, 106, 116, 129) nor the -derivations" (for claims 79, 106, 116, 129) within 
the scope of the claims are claimed. • 

C. Claims 187-188 drawn to a combination of a markush of peptides with a "long-acting insulin analogue" are unsearchable 
since the claim fails to provide any chemical structure (peptidic or non-pep« ; dic) corresponding to the "long-acting insulin analogue"; 
nor is there any description regarding the functions that are analogous, the aegree of analogous structure and/or function or the 
corresponding structure sufficient to characterize a compound as a "long-acting insulin analogue". 

D. Claim 57-65, 107-1 15, drawn to insulin,receptor agonist (plurality of site 1 peptide binders ) consisting essentially of 
Formula 1 , to increase insulin receptor activity is unsearchable since: there is no upper limit on the overall length of the peptide 
sequence; there is no upper limit on the plurality of formula 1 sequences; and there is no means of positionally relating the various 
phiral sequences to each other. 



BOX II. OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING 

This application contains the following inventions or groups of inventions which are not so linked as to form a single 
general inventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search fees 
must be paid. 

GP 1. Use of an insulin receptor antagonist (site 1 peptide binder -site2 peptide binder; linked C-N terminus) consisting essentially 
of Formula 1 (site 1)- Formula 6 (site 2) to decrease insulin receptor activity; claims 2-14 (SEQ ID'S 1554 AND 2129 ARE 1st 
appearing peptide sequences for Formula 1 and Formula 6, respectively) . 

GP 2. Use of an insulin receptor antagonist (site 1 peptide binder -site2 peptide binder ;linked C-N terminus) consisting essentially of 
Formula 1 (site 1)- Formula 4 (site 2) to decrease insulin receptor activity : claims 15-27. 

GP 3. Use of an insulin receptor agonist (site 2 peptide binder -site 1 peptide bindenlinked C-N terminus) consisting essentially of 
Formula 6 (site l)-Formu!a 1 (site 2), to increase insulin receptor activity: claims 29-43 and 181-186 

GP 4. Use of an insulin receptor agonist (site 2 peptide binder -site 1 peptide bindenlinked C-N terminus) consisting essentially of: 
Formula 4 (site l)-Formula I (site 2) , to increase insulin receptor activity: claims 44-55. 

GP 5. Use of an insulin receptor agonist (site 1 peptide binder -site 2 peptide binder: linked C-C terminus) consisting essentially of 
formula 1 and formula 6, respectively, to increase insulin receptor activity: claim 67-78. 

GP 6. Use of an insulin receptor agonist (site 1 peptide binder -site 2 peptide binder: linked N-N terminus) consisting essentially of 
formula 1 and formula 6, to increase insulin receptor activity: claim 67-78. 

GP 7. insulin receptor antagonist (site 1 peptide binder -site2 peptide binder: linked C-N terminus) consisting essentially of Formula 
1 (site I)- Formula 6 (site 2) : claims 130-142. 

GP 8. insulin receptor antagonist (site 1 peptide binder -site2 peptide binder :linked C-N terminus) consisting essentially of Formula 1 
(site 1)- Formula 4 (site 2): claims 143-155 

GP 9. insulin receptor agonist (site 2 peptide binder -site 1 peptide bindenlinked C-N terminus) consisting essentially of Formula 6 
(site l)-Formula 1 (site 2) : claims 80-94. 

GP 10 insulin receptor agonist (site 2 peptide binder -site 1 pepude bindenlinked C-N terminus) consisting essentially of: Formula 
4 (site l)-Formula 1 (site 2): claims 95-105. 

GP 1 1 insulin receptor agonist (site 1 peptide binder -site 2 peptide binder: linked C-C terminus) consisting essentially of formula 1 
and formula 6: claims 1 17-128 (in part). 

GP 12 insulin receptor agonist (site 1 peptide binder -site 2 peptide binder: linked N-N terminus) consisting essentially of formula 1 
and formula 6: claims 1 17- 128 (in part). \ 
GP 13. An insulin receptor agonist comprising Seq. Id 2033, pharmaceutical composition thereof and method of increasing insulin 
receptor activity : claim 1 68- 1 7 1 (in part) . 

GP 14. An insulin receptor agonist comprising Seq. Id 2034, pharmaceutical composition thereof and method of increasing insulin 
receptor activity: claim 168-171 (in part) 
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GP 15. Use of an insulin-like growth factor (IGF) receptor agonist (site 2 peptide binder -site J peptide binder:Iinked C-N terminus) 
consisting essentially of Formula 6 (site 1) and Formula 1 (site 2), to increase IGF receptor activity: claim 174. 
GP 16. An insulin-like growth factor (IGF) receptor agonist (site 2 peptide binder-site 1 peptide binder: linked C-N terminus) 
consisting essentially of Formula I (site I) and Formula 6 (site 2): claim 176. 

GP 17. A method of treating diabetes by administering a first peptide compound and a second long acting insulin analogue 
compound: claim 189. 

The inventions listed as Groups (1-6) and (7-12), respectively do not relate to a single general inventive concept under PCT 
Rule 13.1 because, under PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: 
the prior art teaches insulin receptor agonists/antagonists ; IGF- 1 receptor agonists/antagonists within the scope of the presently 
claimed "generic" invention (e.g. site Va peptide receptor binding peptides); and means for further screening of such agonists and 
antagonists . See Yoshida et al. Anal. Cbem. Vol. 72, No. 1 (Jan 2000) pages 6-IJ (peptidic insulin receptor agonists/antagonists and 
means for screening ); Bell et al. US 4,761,371 ( antibody insulin receptor agonists/antagonists and means for screening); Jameson et 
al. WO 93/23067 (1 1/93) (IGF- 1 peptide receptor agonists/antagonist and means for screening). Additionally, the peptide/protein 
compositions of Groups 1-17 lack the same or corresponding tecJinical features since the peptide/proteins of the different groups are 
unrelated and/or structurally distinct due to differences in <nmino acid composition and/or length and/or means of attachment of the 
site 1 to site 2 peptides to each other . 

Election of Species : (Groups 1-12 and J 7): 

1 . This application contains claims directed to more than one species of the generic invention. These species are 
deemed to lack unity of invention because they are not so linked as to form a single general inventive concept under PCT Rule 13.1. 

Groups 1-12 require the following election of combination species; based on the following approximate number of different 
sequence id's within formula 1 , 4 and 6 as follows: 
Formula 1 : @ 950 sequences 

Formula 4: @ 300 sequences Formula 1x6= 213,750 species 

Formula 6: @ 225 sequences Formula 1x4= 285,000 species 

The Group 17 peptide species are based on the 118 peptide species of claim 187. 
The species are as follows: 

GP .1: SPECIES 1- 213,750 Formula 1 x Formula 6 sequences 

GP 2: SPECIES 213,751- 498,751 Formula 1 i Formula 4 sequences: 

GP: 3: SPECIES 498,752-712,502 Formula 6 x Formula 1 sequences ' i 

GP 4: SPECIES 712,503-997,502 Formula 4 x Formula 1 sequences: 

GP 5: SPECIES 997,503- 1 ,21 1 ,253 Formula 1 x Formula 6 sequences: 

GP6: SPECIES 1 ,21 1 ,254- 1 ,425,002 Formula 1 x Formula 6 sequences; 

GP 7: SPECIES 1 ,425,003-1,638,752 Formula 1 x Formula 6 sequences; 

GP 8: SPECIES 1,638,753-1,923,752 Formula 1 x Formula 4 sequences: 

GP: 9: SPECIES 1,923,753-2,137,502 Formula 6 x Formula 1 sequences: 

GP 10: SPECIES 2,137,503-2,422,502 Formula 4 x Formula 1 sequences: 

GP 11: SPECIES 2,422,503-2,636,252 Formula 1 x Formula 6 sequences: 

GP 12: SPECIES 2,636,253-2,850,002 Formula 1 x Formula 6 sequences: 

GP 17: SPECIES 2,850,003-2,850,120 (E.G. 1 18 peptide species of claim 187) 

2. The species listed above do not relate to a single general inventive concept under PCT Rule 13. 1 because, under 
PCT Rule 13.2, the species lack the same or corresponding special technical features for the following reasons: the peptide/protein 
compositions of Groups 1-12 are unrelated and/or structurally distinct due to differences in amino acid composition and/or length 
and/or means of attachment of the site 1 to site 2 peptides to each other . 
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□ faded text or drawing 

□ blurred or illegible text or drawing 

□ skewed/slanted images 

□"color or black and white photographs 

rY SCALE DOCUMENTS 
JS OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



